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Abstract 

^ C"| In a unified framework, we estimate the following quantities of interest in quantum information 

O |' theory: 



o 
o 



1. The minimum-error distinguishability of arbitrary ensembles of mixed quantum states. 



' 2. The approximate reversibility of quantum dynamics in terms of entanglement fidelity. 

^ ' (This is referred to as "channel-adapted quantum error recovery" when applied to the 

composition of an encoding operation and a noise channel.) 

3. The maximum overlap between a bipartite pure quantum state and a bipartite mixed 
state that may be achieved by applying a local quantum operation to one part of the 
^ ' mixed state. 

: 

qq 4. The conditional min-entropy of bipartite quantum states. 

CO ' 
CO 



A refined version of the author's techniques [J. Math. Phys. 50, 032016] for bounding the first 
quantity is employed to give two-sided estimates of the remaining three quantities. 

We obtain a closed-form approximate reversal channel. Using a state-dependent Kraus 
decomposition, our reversal may be interpreted as a quadratically-weighted version of that 
of Barnum and Knill [J. Math. Phys. 43, 2097]. The relationship between our reversal and 
Barnum and Knill's is therefore similar to the relationship between Holevo's asymptotically- 
optimal measurement [Theor. Probab. Appl. 23, 411] and the "pretty good" measurement 
" of Belavkin [Stochastics 1, 315] and Hausladen & Wootters [J. Mod. Optic. 41, 2385]. In 

rN , particular, we obtain relatively simple reversibility estimates without negative matrix powers at 

■ no cost in tightness of our bounds. Our recovery operation is found to significantly outperform 

the so-called "transpose channel" in the simple case of depolarizing noise acting on half of a 
maximally-entangled state. Furthermore, our overlap results allow the entangled input state 
and the output target state to differ, thus obtaining estimates in a somewhat more general 
setting. 

Using a result of Konig, Renner, and Schaffner [IEEE. Trans. Inf. Th. 55, 4337], our max- 
imum overlap estimate is used to bound the conditional min-entropy of arbitrary bipartite 
states. 

Our primary tool is "small angle" initialization of an abstract generalization of the it- 
erative schemes of Jezek-Rehacek-Fiurasek [Phys. Rev. A 65, 060301], Jezek-Fiurasek-Hradil 
[Phys. Rev. A 68, 012305], and Reimpell- Werner [Phys. Rev. Lett. 94, 080501]. The mono- 
tonicity result of Reimpell [Ph.D. Thesis, 2007] follows in greater generality. 
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1 Introduction 



This paper considers the following problem of relevance in quantum information theory: 

The maximum overlap problem: Let p,tcn be a positive semidefinite trace-class 
operator on K. ® H, and let Men be positive semidefinite bounded operator on £ (g) W, 
where %, JC, and C are separable Hilbert spaces. What is maximum overlap 

MO {hkh, Men) = sup Tr (M cn K^ c iwen)) , U) 

where the supremum is over all quantum operations 1Z from 1C to CI 
The maximum-overlap problem has the following important special cases: 

1. The minimum-error quantum detection problem [14]: 

If an unknown quantum state p k is randomly selected from given ensemble of such states, 
with what probability may the value of k be determined by a carefully-chosen quantum 
measurement? 

2. Approximate reversal of quantum dynamics [5 20]: 

Suppose that an arbitrary quantum operation A acts on a given quantum state p. How 
well may the action of A be reversed by application of a recovery channel 1Z, so as to 
preserve the entanglement of the original system with the environment? This problem 
is one of "channel-adapted quantum error recovery" when the operation A is of 
the form A = Mo £, where £ is an encoding operation designed to protect against a 
known noise process Af. 

3. Estimation of conditional min-/max-entropy of bipartite quantum states [21]: 

Let pab be a bipartite quantum mixed state. Estimate the conditional min-entropy 
H min (A\B) of A given B. 

Since all of these problems are believed to defy closed-form solution, the purpose of this paper is 
to provide estimates. In section 3 a refined proof of the two-sided "generalized Holevo-Curlander" 
bounds of [22, 23] for case 1 is given. This method is extended in sections 4 and 5 to yield simple 
two-sided estimates for cases 2-3 and for MO (p, M) in the case of rank-1 M. 

We briefly introduce each of the cases 1-3 before outlining our approach and surveying closely- 
related work. 

1.1 Minimum-error detection 

The minimum-error quantum detection problem was first studied in the 1960's in connection with 
the design of optical detectors [24], and it has since become of importance in quantum Shannon 
theory (for example [25-27]) and in the design of quantum algorithms [28-36]. A generalization to 
the theory of wave pattern recognition may be found in [37]. Various general upper and/or lower 
bounds on quantum distinguishability may be found in [14, 22, 23, 25, 32, 38-44]. 
The minimum-error quantum detection problem is precisely formulated by 

Definition 1 Let 

£ = iPk} keK (2) 

be an ensemble of quantum states, represented as positive semidefinite operators normalized by 
a-priori probability, setting 

Tr p k =p k , (3) 
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where p k is the likelihood that p k will be drawn from £ . A quantum measurement [45] is described 
by a positive-operator-valued measure (POVM), which consists of a vector M = {M k } keK 

of positive semidefinite operators satisfying ^^.A/fc < lJ 105 l (Throughout this paper the operator 
inequality A < B means B — A is positive semidefinite.) The probability that the value k is measured 
when M is applied to a unit-trace density matrix p is given by 

Pr M (k\p)=TrM k p. 

The success rate for the POVM M to correctly determine the value ofk corresponding to a random 
element of the ensemble £ is given by 

Psucc (M) = Pk PrM (fc I — ) = Tr ^ M k p k . (4) 
k \ Pk/ keK 

The minimum-error measurement problem consists of finding a POVM maximizing (4)- 

1.1.1 The relationship to "worst-case" detection 

Sometimes one is interested in the "worst-case" distinguishability 

max min Tr > M k p k , (5) 

M k Z — ' 

of a collection of unit-trace states p k . As pointed out in [46] , the minimax theorem [47] implies that 
maxminTr (M k p k ) = maxminTry^M fe p fc/ 5 fc = minmaxTry^M fc p fe/ 5 fc , (6) 

M k r M { Pk } ^ r { Pk } M ^ 

where {pk} represents a probability distribution. In particular, single-instance bounds (for fixed 
{pk}) may in principle be minimized over all distributions {pk} to give corresponding "worst-case" 
bounds. 

1.2 Channel-adapted quantum error recovery 

The following problem is of importance in quantum information theory, quantum communication, 
and quantum computing: 

Suppose that one wishes to store, process, or transmit quantum data using a process that 
is subject to noise or loss. How well may the effects of this noise be avoided, corrected, 
or eliminated by encoding the data into a protected form, from which it may be later 
recovered unharmed by this noise? 

This problem arises in any physical implementation of quantum communication or computation, 
since unmitigated interactions with the environment tend to corrupt quantum signals or memory. 
By the celebrated "threshold theorem" [48-52], one may in principle use error correction and con- 
catenated quantum codes to perform an arbitrary quantum computation in the presence of noise 
below a fixed "threshold" amount. 

Standard quantum error correction seeks to design encoding and decoding maps which exactly 
correct for a given class of errors. Early successes of this program were the first codes that could 
protect against arbitrary singlc-qubit errors [53-55], followed by general theoretical advances of 
[56], and by the construction of codes that correct for arbitrary singlc-qubit errors by encoding a 
single qubit into five [57, 58]. 

Alternatively, one may consider approximate quantum error correction. For example, Leung 
et al [59] consider relaxed error correction criteria to allow for efficient correction of a known 
dominant noise process. Furthermore, Crepeau, Gottesman, and Smith [60] construct approximate 
error correcting codes which asymptotically correct twice as many arbitrary local errors as would 
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be possible under exact error correction, even though they achieve fidelity exponentially close to 1 
in the limit of long codes. 

Under the banner of approximate channel adapted error correction, a number of authors [7-20] 
alternatively have sought to treat quantum encoding and/or recovery as optimization problems. 
Mathematically, given a "noise" channel N one seeks an encoding operation £ and a recovery 
operation 1Z so that the composition 

3 = KoNo£ 

is as close to the identity channel as possible. Measures of "closeness" to the identity include 

Definition 2 Let p be a mixed quantum state over a Hilbert space rl, which may be represented 
as a pure quantum state \i^ P )fj £ of the original system entangled with an environment £. The 
entanglement fidelity [61] of the quantum operation S : B 1 (TL) — > B 1 (H) is given by 

F e (p,E) = (^ p \E(\^ p )^ P \)H P ). (7) 

(Note that the choice of purification does not affect the defined quantity.) The channel fidelity 
is the entanglement fidelity when p is taken to be maximally-mixed. Given a collection of states pk 
with a-priori probabilities p^, one defines the average entanglement fidelity [H] 

Fe ({(pk,Pk)} , S) = ^TPkFe (Pk, S) • (8) 

Following [8, 11, 14, 16], we shall fix the encoding operation £ and the noise process N. In 
particular, we focus on the problem of finding an approximately optimal quantum recovery map, 
or channel reversal, for the composed map 

in the sense of entanglement fidelity. 

1.2.1 Other metrics for error recovery 

A number of works have considered other measures of reversibility of quantum channels. Kretschmann, 
Schlingcrmann, and Werner [62] have obtained two-sided bounds on the CB-norm reversibility of 
channels in terms of the CB-distance between the complementary channel and a depolarizing chan- 
nel. Ng and Mandayam [20] have employed the transpose channel (a special case of Barnum 
and Knill's [14] reversal) to study quantum error correction using the metric of worst-case (non- 
entanglement) fidelity. Yamamoto, Hara, and Tsumura [17] considered a fixed encoding operation £ 
and used semidefinite programing to find a sub-optimal channel 1Z to roughly optimize the "worst- 
case" entanglement fidelity 

maxminFe (p, TZ o Mo £ ) . (9) 

TZ p 

More will be said about worst-case bounds in section 1.5.4, below. 
1.3 Quantum conditional min- and max-entropy 

The following related quantities (and their e-smooth counterparts) are of interest in quantum cryp- 
tography (for example [21, 63-68]) and/or in studies of non- identically distributed and/or non- 
asymptotic problems in quantum information theory (for example [69-73]): 

Definition 3 Let pab be a bipartite density operator on T-La ®T-Lb- The min-entropy of A 
conditioned on B [21, 70] is defined by 

H mm (A\B) :=-log 2 inf{Tru B I Pab < l A ®v B }, (10) 

f V B 

where the infimum ranges over positive semidefinite v B - The max-entropy of A conditioned 
on B [21, 70] is defined by 

H max (A\B) p := -H mm [A\C) p , (11) 
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where the min-entropy on the RHS is evaluated for a purification pabc °f PAB- The max- 
information [73] that B has about A is given by 

I-max {A : B) p = H min (A\B) -1/2 pAB -1/2 , 

where pa = Tr B Pab ■ 

Estimates of H min (A\B) p arc obtained as a corollary of our estimates for maximum overlap in 
conjunction with the following recent theorem: 

Theorem 4 (Konig, Renner, Schaffner [70]) Let the Hilbert spaces rl a andTiB be finite- dimen- 
sional. Then the min-entropy of A conditioned on B for the state pab may be expressed as 

H mm (A\B) p = - log (dim(H A ) sup (($aa>\Kb^A' (pab) l^AA'))^ , (12) 

where <&AA' is a bipartite maximally- entangled state between A and reference system A' ~ A, and 
where the supremum is over quantum operations from B to A' . 

1.4 Directional Iterates: An abstract approach for deriving estimates 

The first step in proving our estimates will be to recast all of the problems of the first section as 
instances of 

Problem 5 (Maximal seminorm problem) Let S be a subset of a real or complex semidefinite 
inner product space V. Find a maximal- seminorm element of S . (A semidefinite inner product has 
all the usual properties of an inner product, except that one may have (x, x) = for nonzero x.) 

The following generalization of the iterative schemes of [6, 7, 10, 74], will prove useful for 
analyzing this class of problems: 

Definition 6 An abstract Jezek-Rehdcek-Fiurdsek-Hradil-Reimpell-Werner iterate of g £ 

V is an element g( + ) £ S which maximizes R,e(g( + \g}. Such g( + ) will also be called directional 
iterates. 




Fig la: Iterates converging on a; ma x- Fig lb: Note that 

(contours drawn orthogonal to G.) ||^max|| > ||G^ + ^|| > ||lTG ( - + - ) || > ||IIX max || , 

with approximate equality for reasonably small 9. 
Here II = \G) (G\ , and V has real scalars. A 
complex variant appears as inequality 13. 



G 



Useful properties of these iterates are given by 



Lemma 7 (Geometric properties of directional iterates) Suppose that S C V has a maximal- 
seminorm vector x max , and assume that each g € V admits a directional iterate g^ + ' . Then 



Gl. One has the following inequalities 



IK 



> 



,(+) 



> A (g) > \\x 

II cos (#) 



where 



{9 1 ^max) 



cos 9 :~ Re 



Ml IK 



G2. XTie map g ^ g^' is seminorm-increasing on S. In particular, if g € S then 



,(+) 



>yr + 



,9 (+) -.9 



(13) 

(14) 
(15) 

(16) 



Note: The importance of property Gl is this: If one can construct a guess g subtending a reasonably 
small angle with a; max then both A(g) and ||g^|| are reasonably good estimates for 1 1 a^max 1 1 - (Note 
that although || is a closer approximation to ||x max ||, in our applications A (g) will have a 

much simpler expression.) 

Proof. To prove property Gl, note that 



l^maxll — 



,(+) 



>^f^>^%^ = l|w||cos W 

ll.'/ll IMI 



The first inequality is trivial, the second is Schwarz's, and the third is by the definition of g( + > . 
To prove property G2, write 



,(+) 



+ 



|. 9 !| 2 + 2Rc((. 9 (+), 5 )-( 5 , 5 )) 



The last term on the RHS is nonnegative by the definition of g( + \ 
We now may set forth the following: 



General strategy for estimating maximal seminorms: 

1. Find a "small angle guess" g, such that the angle defined by (15) is provably small in 
some approximate sense. 

2. Obtain two-sided bounds for ||a; m ax|| using this bound on 8 in conjunction with (13). 

3. Make this bound explicit by computing g^ and A(g). 



By property G2 of Lemma 7, one may have some hope of obtaining a maximal element as the 
limit of repeated iteration, as occurs in Fig. la. In sections 1.4.1-1.4.2 we review numerical schemes 
in the literature which may be seen as examples of this process. (These sections may be skimmed 
on first reading.) 
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1.4.1 Example 1: Jezek- Rehacek-Fiurasek iteration for POVMs 

Jezek, Rehacek, and Fiurasek (JRF) [74, 75] proposed an unproven numerical method for computing 
optimal POVMs, I 106 ' using iteration of the mapping M \— > given by 

Definition 8 The Jezek-Rehdcek-Fiurdsek (JRF) iterate of a POVM M = {M k } keK [74, 
75] is the POVM defined by 



1/2+ / v -1/2+ 



r (e) 



M r = I ^PtM m J p k M kPk I ^ P MW J 

\ieK ) \£ei< J 



Here the negative matrix power is defined by 

Aj>0 

for s > and self-adjoint A with spectral decomposition A — y^Ajllj ■ 

Jezek, Rehacek, and Fiurasek made the following: 
Numerical Observation 9 (JRF [74, 75]) JRF iteration monotonically increases success rate: 



p 

1 SI 



Af<®>) >Psucc(M). 



Furthermore, iteration of this map starting from {M^ = 1} converges to an optimal measurement 



lim Psucc 



(M {<s Y)=P succ (M°P t ). (19) 



In section 3.1, JRF iteration is exhibited as a disguised form of directional iteration. JRF's 
numerically-observed monotonicity then follows immediately from property G2 of lemma 7. 

1.4.2 Example 2: Jezek-Fiurasek-Hradil and Reimpell- Werner iterates 

Jezek, Fiurasek, and Hradil (JFH) [6, 75] proposed an unproven numerical scheme for the maximum- 
likelihood problem [75-78] in quantum process tomography, which contains the maximum-overlap 
problem (1) as a special case. I 107 ' 

Reimpell and Werner [7, 10] introduced a mild generalization of this special case of JRH's 
algorithm, for use in finding maximizers of the following: 

Definition 10 A Reimpell- Werner functional [7, 10] 1Z n- f (11) is a linear functional of 
linear transformations 1Z : B 1 (tC) — > B 1 (C) such that f (1Z) > for all completely positive 1Z. 

Reimpell and Werner were interested in the special cases of approximate quantum error recovery 
and quantum encoding in the sense of channel fidelity. In particular, setting 

f A r(£,Tl) = F e (t/dimH,lZoMo£), (20) 

where Af is a known noise map, they alternatively optimized the encoder £ and decoder 1Z in a 
seesaw fashion. 

By analogy with the matrix-power method [7, 10], they proposed an unproven numerical method 
for maximizing / (1Z) by iteration of the following map: 

Definition 11 Let C and K, be finite- dimensional, and represent the Reimpell- Werner functional 
f as 

f(K)=T±(FK), (21) 
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where 7Z € B 1 (C ® K*) is the Choi matrix of 1Z (see Definition 27) and F is a positive operator on 
CK* . The Reimpell- Werner iterate 1Z® oflZ [7, 10] is the quantum operation with Choi matrix 

TZ® =T- 1 / 2+ FRFT- 1 /^, (22) 

where Y : CK,* — > CJC* is given by 

r = l c ® Tr (FRF^J . (23) 

Reimpell [10] proved the monotonicity property / (TZ®) > f (1Z) using a clever matrix analysis 
argument. In particular, the optimal map 72. is a fixed point of this iteration. 

In Appendix B we show that Reimpell- Werner iteration (and the special case of restricted JRH 
iteration) may be viewed as directional iteration on the corresponding space of Stinespring dilations. 
In particular, Reimpell's monotonicity result is exhibited as a special case of Lemma 7. 



1.5 Relevant existing bounds, suboptimal measurements, and approxi- 
mate reversals 

1.5.1 Quadratic measurements and Generalized Hole vo- Cur lander bounds 

Definition 12 Let £ = {pk \ipk) (V'fcllfceA: be an ensemble of pure states. Then Holevo's pure 
state measurement [43] is given by M k = \ek) (&k\> where 

e k = e™™° := (jj> 2 k l^fc) (^l)^ Pki>k- (24) 
Holevo constructed this measurement using an approximate minimal principle, and proved 

Theorem 13 (Holevo's asymptotic optimality theorem [43]) Holevo's measurement is 
asymptotically- optimal for distinguishing pure states in the sense that for fixed probabilities {pk} 
one has 

Pfaa({e^°}) ^ 1 

Yjoptimal ^ ' 

*fail 

as the ipk are varied so that (ipi, ipj) — > 8ij. Here Pf a u = 1 — Psucc represents the failure rate. 

A natural mixed-state generalization of Holevo's measurement is given by 

Definition 14 The quadratically- weighted measurement [22, 74] for distinguishing the en- 
semble (2) is the first Jezek-Rehdcek-Fiurdsek iterate 

Mr=(E^ 2 )" 1/2+ ^(E/?)" /2+ - ^ 

Remark: The quadratically-weighted measurement is an example of a Bclavkin-Maslov measure- 
ment (see page 39 of [37]). 

Generalizing the pure-state results of Holevo [43] and Curlander [44], the author proved the 
following: 

Theorem 15 (Generalized Holevo-Curlander bounds [22]) One has the following bounds on 
the success rate of the optimal measurement M opt for distinguishing the ensemble £ of Definition 
1: 

A 2 < P succ (M QW ) < P succ (M°f f ) < A, (27) 

where 



A = Tr vE^^ L ( 28 ) 

Note: The upper bound of (27) was essentially a special case of a pre-existing bound of Ogawa 
and Nagaoka [23] , which is a simple consequence of matrix monotonicity. 
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1.5.2 The "pretty good" measurement and Barnum & Knill's distinguishability bound 

Another approximately-optimal measurement is the linearly-weighted measurement given by 

Definition 16 The Belavkin-Hausladen-Wootters "pretty good" measurement (PGM) 

[79-82] is given by 

M k PGM = (J>) " /2+ Pk (j>) " /2+ • (29) 

A comparison of the PGM with Holevo's pure state measurement was conducted in [83]. It was 
found that Holevo's measurement outperforms the PGM for ensembles of two pure states, and that 
the PGM does NOT satisfy Holevo's asymptotic optimality property (25). 

The PGM is approximately-optimal for "reasonably-distinguishable" ensembles in the following 
precise sense: 

Theorem 17 (Barnum-Knill [14]) The success rate of the PGM satisfies 



Psucc (M°r*) 
where M opt is an optimal measurement. 



M opt ) , (30) 



Re-expressing this inequality in terms of Pf a ii = 1 — Psucc, one sees that the PGM has a failure 
rate within a factor of two of the optimal: 

Pfail (M opt ) < P fail (M PGM ) < (1 + P succ (M opt )) P fail (M opt ) < 2 x P fail (A/ opt ) . (31) 

The relationship between Barnum and Knill's bound (30) and the bounds of Theorem 15 is explained 
by the following proposition: 

Proposition 18 (Comparison with the Barnum-Knill bounds) Both of the lower bounds of 
inequality 27 are sufficiently tight to also satisfy Barnum and KnilVs tightness relation (30): 

Psucc (M^ A 2 

Psucc (Mo P t) - Psucc {M o Pt) * Psucc [M ) . W 

In particular, P faU (M QW ) , 2 (1 - A), and 1-A 2 all lie in the interval [P fa , a (M opt ) , 2 x P fail (M opt )]. 

Proof. Equation 32 follows immediately by double application of inequality 27. The claimed 
inclusions follow as in inequality 31, where one additionally uses the inequality 1 — A 2 < 2 (1 — A). 
■ 

1.5.3 Barnum and Knill's approximate reversal map 

Generalizing the "pretty good" measurement I 108 ', Barnum and Knill have constructed a reversal of 
an arbitrary quantum operation A : B 1 (H) — > B 1 (JC) that is approximately optimal for reasonably 
reversible A in a precise sense: 

Theorem 19 (Barnum-Knill [14]) Assume that the density operators pu <E B 1 (rl) of equation 
8 commute, set p = J^PkPk, and let A^ : B (/C) — > B (H) be the adjoint of A (see Def. 24, below). 
Then the recovery operation 

K BK (v) = VM f (P)r 1,2+ v {A (p)y 1/2+ ) y/p (33) 
is approximately optimal in the sense that 

F e ({ Pk ,p k },K BK oN) 

■=—- r — — m > m&xF e ({p k ,p k } ,KoAf ), 34 

m&yL n F e ({p k ,p k \ ,1lo N) n 

where F e is the average entanglement fidelity of equation 8. 
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A special case of eq. 33 is of recent [20] interest in the literature: 

Definition 20 The transpose channel [84] is the special case of the Barnum-Knill reversal 7Z BK 
for maximally-mixed p. 

A reversal of approximately optimal entanglement fidelity which is closely related to the quadratic 
measurement will be constructed in section 5. 

1.5.4 The bounds of Beny and Oreshkov 

Generalizing the problem of quantum error-recovery, Beny and Oreshkov [18] have more-generally 
considered channel simulation. In particular, they consider the "worst-case" entanglement fidelity 

max min F p (1ZA, M ) 
with which the channel A may be used to simulate the channel M. Here one has 
F p (Af,M) = min / (j\f-H->K 

hh r faM) >M-h-*k (\^ p ) HHr (Vvl)) ' 

where ip p is a purification of p and (changing their conventions slightly) / (p, a) = (Tr ' ^fpa \J~p) 2 
is the fidelity between the states p and a. Note that quantum error recovery is the M. = 1 special 
case. Employing the min-max Theorem and a beautiful (and short!) duality argument involving 
complementary channels, they obtain the following theorem: 

Theorem 21 (Beny-Oreshkov [18]) One has the worst-case recovery bounds 

3 1 ~ 



- + > maxminJe (p, 1loA) > hi, (35) 



where the state a is an adjustable parameter, A has Kraus decomposition A (p.) = ^^EjfiEj , and 



p 

Furthermore, if p is fixed then one obtains 
3 1 



A a := minTr \ J2 Ei P 2E J x Tr { E o aE i ) ■ ( 36 ) 



4 F p [A, S) > max F e (p, TZoA)> [F p {A, S) ) , (37) 

where A is a channel complementary to A and S (a) = A(p) x Tr a. 
Remarks: 

1. There is an apparent, but unexplained, relationship between our work below and the results 
of Beny-Oreshkov, which appeared in arXiv preprint form almost-simultaneously to ours. 
Further will be said on this matter in [19]. (See Theorem 44 and Proposition 45, below.) 

2. It is important to note that in the finite-dimensional case that one has the identity 

max min F e (p, 1Z o M o £) = min max F e {p, 1Z o Af o £) . (38) 

1Z p p 1Z 

This follows from the min-max Theorem [85], where the convexity of the mapping p i-> 
F e (p, 7Z o J\f o £) is evident from equation 1.10 of [15] and where one may take the recovery 1Z 
to range over the convex set of quantum operations (trace non-increasing completely positive 
maps). In particular, one may obtain "worst-case" recovery bounds (albeit with unevaluated 
minimization over p) from single-instance bounds on F e (p,lZ o J\f) , which we exclusively 
consider below. 
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1.6 Results 



Section 1.4 has already introduced directional iteration as an abstract method for estimating solu- 
tions of maximal-seminorm problems. This incorporates several explicitly-defined numerical itera- 
tive schemes, including: 

• The iteration of Jezek, Rehacek, and Fiurasek for computing optimal quantum measurements. 

• The iteration of M. Jezek, J. Fiurasek, and Z. Hradil as restricted to the maximum-overlap 
problem. 

• The iteration of Reimpell and Werner for numerically optimizing quantum error correction 
(both encoding and recovery). 

Defined by a minimal-principle, directional iteration monotonically increases seminorm essentially 
by construction. In particular: 

• Jezek, Rehacek, and Fiurasek's numerical observation that their iteration only increases suc- 
cess rate is proven in greater generality 

• This gives a short proof of Reimpell's monotonicity Theorem (pp. 39-42 of [10]) for iterative 
optimization of quantum error correction. 

Section 3 introduces our techniques by presenting a new proof of the generalized Holevo- 
Curlander bounds (Theorem 15) on the distinguishability of arbitrary ensembles of mixed quantum 
states. 

In section 4, Theorem 39 gives concise two-sided bounds for the maximum overlap problem (1), 
in the restricted case that Men is rank 1. Corollary 40 bounds the quantum conditional min- 
cntropy. Appendix C shows how one may apply these bounds to recover the bounds of section 
3. 

Theorem 44 of section 5 applies our overlap bounds to estimate approximate channel reversibility 
in the sense of entanglement fidelity. (The bounds of section 4.2.5 more generally allow the entangled 
input and output states to differ, however.) Our channel-reversibility estimates apply to the case 
of channel-adapted approximate quantum error recovery. 

Section 5.3 compares our reversibility estimates and approximate reversal map to those of Bar- 
num and Knill. Although our bounds take a particularly simple form, they are still sufficiently 
accurate to satisfy the tightness relation (34) satisfied by the bounds of Barnum and Knill. The 
relationship between our recovery map and Barnum and Knill's is found to be analogous to the 
relationship between Holcvo's asymptotically optimal measurement and the so-called "pretty good" 
measurement. Furthermore, our recovery operation is found to significantly outperform the trans- 
pose channel in the case of depolarizing noise acting on half of a maximally-entangled state. 

The conclusion points out directions for future research. 

2 Notation, conventions, and mathematical background 

The reader who is only interested in minimum-error distinguishability bounds should proceed di- 
rectly to section 3, referring back only as directed. 

Definition 22 Let T-L and K. be Hilbert spaces, and let A : % — > K, be a bounded linear operator. 
The absolute value is \ A\ = v A. The space B 1 (H — > JC) consists of all operators of finite trace 
norm \\A\\ l := Tr \ A\. The space B 2 {T-L — > IC) consists of all operators of finite Hilbert- Schmidt 
norm \\A\\ 2 := VTrATK. This space has the inner product 

(A,B)=TtA^B. (39) 
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The space B (H — > JC) consists of all operators of finite operator norm, given by 

\\M = \\A\L= sup J^f. (40) 
o^4>€U \\W\\ 

When TL = IC, these spaces will be denoted by B (%), B 1 (W), and B 2 (%) , for short. An operator 
A is a contraction if \\A\\ < 1. 

It is assumed that the reader is familiar with the following trace-norm inequalities, which may 
be found in [861: 



|TrA| < = 11^^ \IK = U. (41) 

WWAW^WWW^^WA^. (42) 

Furthermore, 

sup Re(TrA t t/) = \A\ X , (43) 
\\u\\<\ 

where A : % — > IC and the supremum is over contractions U : H — > IC. It follows simply from the 
singular value decomposition that U is a maximizer of (43) iff 

U\nM^A)=A(AUy 1/2+ , (44) 



+ 



where (A^A) 1/2 is defined by (18). 

Definition 23 Let A be a self-adjoint operator. The positive projection H + (A) is the projection 
onto the closure of the range of the positive part of A. In particular, if A has spectral decomposition 
A = \ipi) (ipi\ then 

U + (A)=J2\Vi)(Vi\- (45) 

Ai>0 

A more thorough discussion of most of the following terms may be found in [45] : 

Definition 24 A quantum state is a trace-class positive semidefinite operator p on a Hilbert 
space %. (Generally states are of unit trace, although in section 3 it will be convenient to normalize 
them by a-priori probability.) The support supp (A) of the transformation A : % — > K, is the 
closure of the range of A' A, or equivalently the orthogonal complement of the null-space of A. A 
quantum channel is a trace preserving completely positive map. A quantum operation is a 
trace non-increasing completely positive map. A linear operator U : IC — > C <8> £ is a Stinespring 
dilation [87] of a completely positive map 7Z : B 1 (IC) — > B 1 (£) if 

K (p) = Tr (UpZ7+) (46) 
for all p £ B 1 (IC). The adjoint TZ' : B (C) —> B (IC) has the defining property that 



Tr (X c K K ^c (Yk)) = Tr (K^ K (X c ) Y K j (47) 

forXeB (C) and Y g B 1 (fC) . 

It is important to note that if 1Z and U are related by (46) then 7Z is a channel iff U is an 
isomctry (U'U = l), and 1Z : B 1 (IC) —> B 1 (C) is a quantum operation iff U is a contraction. 
Furthermore, it is observed in Appendix A that each quantum operation A has a canonical dilation 
with the canonical environment 

£ = £* £ ®tC £ , (48) 
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where K.£ and C* £ are copies of JC and the dual space of C, respectively. 



Tensor product notation: A linear operator will often be denoted as A-u^k, and 

will be identified without further comment with any operator of the form A <8> fl.£ where l£ is the 
identity operator on some other Hilbert space C. If \ip) € C, the transformation \ip)®A : TL — > JC®C 
is defined by 

{\*) c ®A n ^ K )\4>) H = \1>) c ®\M)K ( 49 ) 
When A : B 1 (TL) — > B 1 (/C) is a quantum operation, it will often be denoted as A-h^k- 

2.1 Basis-free constructions using dual spaces and double kets 

As in a number of previous works on channel-adapted quantum error recovery [15-17], in Section 5 
and in the Appendices A and B it will prove convenient to treat Hilbert spaces and their duals on 
equal footing: 

Definition 25 The dual space TL* of the Hilbert space TL is the set of linear Junctionals tp : TL — > C 
of the form 

$(<j>) :=(lM) -n^C, (50) 
where ip £ TL. The space TL* is a Hilbert space in its own right with inner product 



(lpl,^2) H , ■■= {(lpUlp2) H ) = {lp2, 1pl) n , (51) 

where the bar in the middle denotes complex conjugation. 

Use of the dual space as a Hilbert space in its own right has pleasant computational properties 
which are amenable to Dirac notation. I 109 ! For example, if ip £ TL has the coordinate expansion 

V> = 5> I*')* ( 52 ) 
then the dual vector ip € TL* has the expansion 

$ = ^j4>% \i)w > ( 53 ) 
where the coordinates ipi := W)-u.* arc simply the complex conjugates of the coordinates ipf. 



1>i = (54) 
Given the linear transformation A £ B 2 (TL — > JC) 

A n ^ K =^Aij\i) K {j\ n (55) 

one may form the conjugate operator A : TL* — > JC* , the transpose A tr : JC* — > TL* , and the 
basis-free double ket \A)) Kn , <E JC (8 TL* by 

Ah*^k* = J2 A v \*)k' G\w (56) 

4e r .->«- = J2 A v \3)h* (57) 

I^W = E A ^' l tf >ic lJ>w. ■ (58) 

These equations may be replaced by basis-independent definitions, since they are uniquely-specified 
by the identities A<p = (A<j>), A tr = A\ and (ipK,\ {4>u* \ \A)) KH , = for ^ e ^ and ^ e ^> 

respectively. 

The basis-free double bra 

(Mi™- =E^(^ ( 59 ) 
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denotes the linear functional on K. ® T~L* corresponding to \A)). The partial transpose is the 
isometric extension of the mapping A-^^k ® Bc^m ^ ^ac-^-h* ® Bc^m, i-e. 

^ PT 2 (Y.XmMU \m M ) 1%) (hn\ (tc\) = Y.X mkM \m M ) \hu») (te. \ (lc\ , (60) 

which maps B 2 (% ® C ^ K ® M ) -> B 2 {JC* ®C^U* ® M), where £ and .M are arbitrary 
Hilbert spaces. 

We collect some useful identities involving basis-free double-kets: 
Lemma 26 J. // A,B :U -> K then 

iA,B)) Kn ,=TvA^B (61) 

2. Let A : K. —> £, B : H —> M, and C :H —> }C. Then 

(A K ^ C ® B H ^ M .) \C)) KW = \ACB^)) CM , . (62) 

3. Let A : H -> C and B : K -> C. Then 



U\ cu> >< \ B ))ck* = Tr \B)) CK , ((A\ cu , = B*A (63) 

I Let A : U -> JC and B : U -> C. Then 

ll\A)) KH , {{B\ cn ,=A n ^ K B^ H (64) 
b H u^b H u) {IA))ich ' 1 1 B \ch*) = B^u®A h ^k:U®C^U®1C (65) 
Note that by multilinearity it is enough to check these identities for rank-1 operators. 
Definition 27 The canonical purification [110] of a quantum state p £ B 1 (]H) is given by 

I^p) = IVP>W ■ ( 66 ) 

When K, is finite-dimensional^ 111 ^ the Choi matrix [88] of a transformation 1Z : B 1 (/C) — > B 1 (C) 
is given by 

n = n{\t,) K ^{{i\)£B l {CK*)- (67) 

Note that by (64) and (63), i[ p has the standard defining property 

p = Tr \%) Q, p \ (68) 

ri 



of a purification of p, and also of p 



p = Tr|Vv)(^p|. (69) 

ri 

— 1/2 

In particular, if *H is finitc-dimcnsional then the state (dim'H) is maximally-entangled, 

and indeed the singular value decomposition of an operator 

A = $> \fi)(9i\ 
corresponds precisely to the Schmidt decomposition of its doublc-kct 

Remark: A basis-free construction of the Stinespring dilation may be found in Appendix A. 
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3 Minimum-error distinction as a maximal seminorm prob- 
lem 



The minimum-error quantum detection problem of Definition 1 may be reformulated as a maximal- 
seminorm problem using the identity 

Psucc (M) = \\Ef £ , (70) 

per the following definition: 

Definition 28 Let £ = {pk}keK ^ e ^ e ensemble of Definition 1. A vector of operators E = 
{Ek : H — > T~L}keK * s a generalized measurement (GM) [24] corresponding to the POVM M = 
{7\/fe} fcgA - if one has the decomposition 

M k = ElE k . (71) 

The £ -semi-inner product is defined for vectors of operators F — {Fk : T~i — ^ H-\keK an d G — 
{Gk : % -> %} keK by 

(F,G) £ = Tr J2 F lGkPk- (72) 

keK 

The £-semi-inner product space is the space V £ = {E \ \\E\\ £ < oo}, on which (»,») £ is well- 
defined. The set Ss C Vs will denote the set of generalized measurements of £ . 

Remark: It is important to note that if £ is a perfectly distinguishable ensemble of more than 
one element then \\*\\ £ is only a seminorm. In particular, any cyclic permutation E' of a perfectly- 
distinguishing generalized measurement must satisfy ||£"|| =0. 

3.1 Computation of directional iterates 

Our first step is to compute directional iterates for generalized measurements: 

Theorem 29 (Directional iteration for generalized measurements) Take S = Ss and V = 
Vs as in Def. 6. Then a directional iterate of E £ Vs is given by 

4 +) = E kPk (j2pt E l E tpt) > ( 73 ) 

where the exponent is given by equation 18. Furthermore, one has the identity 

(e(+\e) £ = Tv^JY,P* M ^- ( 74 ) 

Remark: It follows by comparison of eq. (73) with eq. (17) that the iteration E \— > E( + ' for 
GMs corresponds to Jezek, Rehdcek, and Fiurdsek's iteration M \— > M® for POVMs. In particular, 
Psucc (AT®) > -Psucc {M), as was observed numerically in [74]. 

Proof. The proof is an easy modification of that of Theorem 9 of [22], which employs the E^ = 1 
special case of (73). One has the identity 

Re{E,F) £ = ReTrV^Up, (75) 
where V E , U F : H -> H <g> C K are defined by 

Vsip = ^2 ( E kPki>) ® \k) C K 

keK 

U F ip = (Fkip) ® \k) C K , 
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where \k) CK is the standard basis of C K . Then F is a generalized measurement iff Uf is a contrac- 
tion, with \\Uf\\ < 1- But a contraction Uf maximizing (75) is computed using equation 44 

Equations (73) and (74) follow. ■ 



3.2 A "small-angle" guess 

In order to use Lemma 7 to prove distinguishability bounds, one must construct a guess G sub- 
tending a provably-small angle with an optimal generalized measurement E opt . As a hint of how 
to proceed, consider the case that the ensemble £ = £ PD is a perfectly-distinguishable ensemble, 
consisting of states pk of mutually-orthogonal support. An optimal GM is simply given by 

eT = n+ ( Pk ) , (76) 

where the positive projection on the right was defined in (45). Use of spectral theory may be avoided, 
however, if one notes that the semi-inner product (E,F) £ of equation 72 is sensitive to the action 
of the Ek and F^ only on the ranges of the corresponding pk- In particular, the simplest-possible 
"guess" 

G k = t for all k (77) 

satisfies G = E° pt mod (•, ») £ , since 

ll^° Pt - G Cpo = Tr]T (H+ (p k ) - tf Pk = 0. (78) 

This equation suggests that the guess (77) will remain appropriate for "reasonably distinguish- 
able" ensembles. Indeed, equation 73 shows that the iterate G^ + > corresponds to the mixed-state 
generalization of Holevo's asymptotically optimal measurement (24)J 112 1 Furthermore, one obtains 
the following "small angle" estimates: 

Lemma 30 Define G G Vs by equation 77, and let M be a POVM of non-zero success rate. Then 
one can decompose AI k = Ej,Ek in such a way that (G,E) £ € R and 

cos (9) := pyljj > VPsucc(M). (79) 

Proof. Chose M k — ElE^ arbitrarily. By the polar decomposition, there exist unitary Uk '-H—tH 
so that UkEkPk > for all k. Setting 

Ek = UkEk, 
it follows from Holder inequality's (42) that 

(G,E) £ = TrJ2 E kPk = J2 W E kPk\\i > l^ElEkPk = P succ (M) . (80) 



Using the fact that ||G|| £ = 1, the conclusion follows by dividing both sides by \\E\\ £ = P succ (M). 
■ 

Remark: Note that if one rescales G into generalized measurement, as is only possible when the 
index set is finite, then one obtains the "random guessing" measurement Gu = t./ -J\K\. The 
reader may therefore be surprised that the first iterate (which is unaffected by rescaling of G) 
corresponds to Holevo's asymptotically-optimal measurement, since this guess isn't even "smart" 
enough to take into account the a-priori probabilities of the 

The resolution of this paradox is that the unrescaled guess Gk = 1 does NOT correspond to 
random guessing: it guesses every value k. Equation 78 is therefore analogous to the statement 
that the teacher who grades a multiple-choice test by means of a punched overlay (with holes only 
for the correct answers) will give top marks to the daring schoolboy who marks ALL of the ovals 
on his exam. 
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3.3 A simple proof of the generalized Hole vo- Cur lander bounds 

We have assembled all the pieces necessary to apply Lemma 7: 

Proof of Theorem 15. Take V = Vg and S = Se to be as in Definition 28, and let G be given by 
(77). By Theorem 29, one has 

('G(+)) t G(+)=MQ w . (81) 

By Lemma 30 we may decompose M^ pt = (E°. pt ) £^ pt in such a way that the "small angle" 
estimate (79) holds in the equivalent form 

(E op \G) £ >P SU cc(M opt ) (82) 

given by (80). Replacing a; max by E^ pt in (13) gives 

v/P succ (A/°p*) > (MOW) > A (G) > (E°p\ G) £ , (83) 

where we have used the fact that ||G|| £ = 1. But by equations (14), (74), and (28), 



A(G)=TryXp|=A. 

The last inequality of (27) follows by appending (82) to (83). The remaining three inequalities of 
(27) follow by squaring (83). The inequality A < 1 follows by (27). ■ 

4 Maximum overlap as a maximal-seminorm problem 

The maximum overlap problem of equation 1 may be expressed as a maximal scminorm problem 
using the identity 

Tr {MchRk-^c (/4cw)) = \\V\^ M , (84) 
where U is a Stincspring dilation of 1Z and the seminorm is from the following definition: 

Definition 31 Let £ = C £ ® fCg be the the canonical environment (4$) ofTZ/c^c- For operators 
A, B : H — > £ <g> £, the ;U-M semi-inner product is defined by 

) ■ (85) 

The /i-M semi-inner product space is the space V^.m = $U : JC —> C£ ||C/|| M m < oo|, on 

which (•, •) M is well-defined. The purification ball is the set S — { U : JC — > ££ | \\U\\ < 1} C 
Vh,m, where ||«|| is £/ie operator-norm. 

4.1 Computation of directional iterates 

As in the case of measurements, it is not difficult to compute directional iterates: 

Theorem 32 (Directional iteration for maximum overlap is JFH iteration) Let V = V^,m 
and S be as in Definition 31. Then the operator Uic^cs € Vfi,M has the directional iterate 

Uj?_l L£ = Q{Q*Q)- 1/2+ , (86) 

where 

Qk^cs = Tr {MchUk^cs^ku) ■ (87) 

ri 

Furthermore, one has 

(U,U^) =||Q^ £ |li- (88) 
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Remark: Let Uteres be a Stinespring dilation of a CP map IZ/c^c- Then the operator U^\ C£ 
of the above Theorem is a dilation of the Jezek-Fiurasek-Hradil iterate [6, 75], already mentioned 
in section 1.4.2. 

Proof. By cyclicity of the trace and equations (43)- (44), 

W ™* £S ^(U,W)^ M = max Re (M CH W K ^ KH (tf^J 

= \\Qk^cs\\ x , (89) 
with maximizer W = CA + ) given by (86). ■ 

4.2 The restricted maximum-overlap problem 

The remainder of this work restricts consideration to the case that 

Men = \<I>)ch M ( 9 °) 
is a rank 1 projection, seeking to estimate 

MO (hkhAch) ■= sup (cficnl T^k^c (vich) \<ficu) > (91) 
n 

where the suprcmum is over quantum operations TZ from JC to C. For convenience, we denote 

{A K ^cs,B K ^cs)^ ■= (A B)^ m = Tr (Bjc-v^Mkw (A+J^J \<fi CH ) (92) 

and = V r AIj |0)( <?i |. 

It is worth mentioning that by Theorems 1 and 2 of [70] (see also equation 182 of the appendix), 
the minimum-error detection problem is a special case of the restricted maximum overlap prob- 
lem. The importance of this fact for this work is as follows: One may use the study of quantum 
measurements as a testing ground to for techniques for the study of the maximum overlap problem 
and its special cases, including quantum error recovery. Furthermore, as we have already seen, 
Barnum and Knill [14] considered channel reversibility in the sense of average entanglement fidelity 
by generalizing the "pretty good" measurement. 

4.2.1 A minor simplification 

We use the following notation for the partial traces of \<ficu)' 

<fic = ^j\4>)cn^\ (93) 
4>H=^\4>) cn (cfi\ ( 94 ) 

Using the identity 

\<f>cu) = n+ {(fin) \4>cn) , (95) 
where the positive projection IT + ((fin) is given by equation 45, one has the following 

Observation 33 One has the identity 

{<}>cu\TZk^c (m/cw) \4>ch) = (<t>cn\T^K-^c (P>kh) \<ficu) , (96) 
for any TZ, where fi is defined by 

pLKH = n + [<M x Wch x n + {(fin) . (97) 
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4.2.2 A "small angle" guess 

The strategy of Sec. 1.4 calls for construction of a guess G E V^ h< p subtending a provably small 
angle with some dilation W opt : JC — > C <£> £ of an optimal overlap operation 7?° pt . We will be most 
concerned with the "reasonably ovcrlappable" case, for which one has the crude approximation 

sup {(j)cu\ TtfcXc (A/ch) \<j>cu) « Tr (98) 
n 

Our choice of guess will therefore be motivated by the case in which exact equality is obtained: 

Proposition 34 (The perfectly overlappable case) Let \4>ch) be a unit vector and let 1Z be a 

quantum operation. Then one has perfect overlap 

(<Pch\TIk^c (A/ch) \<t>cu) = Tr/i (99) 

if and only if 

n^ K {\4>cn) (M) ^ . , = (100) 
where the adjoint TV is given by (^7). 
Proof. Since 

TV (\<f> £H ) (<Pch\) < TV (1) < 1, 
the conclusion follows from (47) and (43)- (44). ■ 

In section 3.2 we saw for finite ensembles that a properly-rescaled version of the "daring school- 
boy's" guess {Gfc = 1} could be implemented by "random guessing," without use of any mea- 
surement apparatus. This suggests consideration of a guess G/c->ce for which the corresponding 
(possibly trace-increasing) CP map 

TZ G (p) = Tr (GpGt) (101) 

is independent of p. 

The following Lemma shows that an analogue of equation 78 is satisfied by a guess of this kind: 

Lemma 35 (Construction of a guess) Let <j>cH be a unit vector and let Gic^cs be a dilation 
(101) of the (usually trace-increasing) CP map 

C £ (p)^^ 1+ xTr(p), (102) 

where £ is the canonical environment (48). Here we use the notation introduced in equations 18 
and 93. Then 



1. One has the identity 



2. If p and (f> are "perfectly overlappable" by a quantum operation TZ — TZ opt , as in equation 99, 
then TZ opt has a dilation W^ P _^ C( ^ £ such that 

\\G -W opt \\ ticj) = 0. (104) 



Remark: Note that the choice of dilation G docs not affect the operation 

fc<+> (p) :=TvG^p{G^y , 
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since the replacement G — > U £ ^ £ G)c~^c£i where U £ ^s is unitary, simply induces the replacement 
Proof. Equation 103 is trivial. To prove the equation 104, note that equations 47 and 103 imply 

<Ae-nc \<t>cn) (4>cu\ G k ^ C £ = (^ G )Lc (^«D = n + (<M ® ( 105 ) 

In particular, Gx-»x<8£ restricts to an isometry from ran(0%) ®K D ran(/i^^) into £. Let 

W)c^c£ ^ e a dilation of 7?. opt . By Proposition 99 it similarly follows that (4>cu \ ^Ic^cs 1S a ^ so an 
isometry on ran (/tjcw), implying that there exists a unitary such that 

Xe^e (4>cu \ W K ^ m£ = ((pcnl G K ^c®s 

on ran(/i/c%). Equation 104 follows from (92) by setting Wjc^cgg = 

4.2.3 Angle Estimates 

The following estimate shows that Gjc-^cs remains a reasonably-good guess when <fi and /i are only 
reasonably-overlappable, c.f. inequality 79: 

Lemma 36 (Angle estimates) Take <j>cu to be a unit vector, take the CP map Ttfc^c an ^ 
guess Gfc^c®£ to be as in Lemma 35, and let TZk^c be any quantum operation for which 

(4>ch\T^k^c (pku) \4>ch) > 0. 
Then 1Z has a Stinespring dilation W/c-^cs such that (W, G) ^ € M and 



cos {&) := -j— — t j-^-y — > . (106) 

II^IU H G IU \ TL^ K {\4>cn) {<t>cn\) Trfe) 

\ oo 

Here fitcu is given by {97) and the adjoint 7t) is from equation 47- Furthermore, one has the 
identities 

HG||^ = Tr(A^) (107) 



, G)^ = Tr y/(<l>\ CH p. 2 KH \ci>) cn (108) 
where G^ + > is the iterate of G given by Theorem 32. 

Remark: Note that cos (9) = 1 if perfect overlap (4>\ CH Hic->c (Ajch) l^cn = I^Ajch is achieved. 
Proof. Equation 107 follows from part 1 of Lemma 35: 

l|G||^ = (0£«|7^_ £ (£kh)\<I>ch) =Tr({K G y c ^ K (\<t>cn)(<l>cu\)ti>cu) = TrA™ 
To prove the angle estimate (106), note that one has the identity 

(W^ C£ ,G)^ = TrP £ ^ £ , (109) 

where 

Pe^e = {4>cu\ G K ^ C £»KH (W*)^^ \4>cu) ■ (HO) 

Starting with any dilation Wic^cs °f we may assure that Ps^s is positive scmidefinite by a 
replacement 
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where the unitary operator Xg^£ comes from the polar decomposition of Ps->£ . It follows that the 
LHS of (106) is real and maximized over the choice of dilation of 1Z. We claim that there exists a 
an operator Z : £ — > £ such that 



1/2 



\z. 



(111) 
(112) 



Assuming this claim, Holder's inequality (42) implies that 

1 



> 



Tr Z £ ^ £ P £ ^ £ 



n c ^{\M (4>cu\) 

The angle estimate (106) follows by dividing both sides by 

\<PCH 



1/2- 



(113) 



and by the square root of equation 107. 
To prove the claims (111) — (112), define 

Ze^e = {<t>au\ W K ^ae {&) C£ ^ K 
where £ is a copy of C. Then 



\4> 



CH) 



(114) 



I G K ^c£Wl, e 



(Gt) 



1/2 



TV, 



1/2 



But as in the proof of Lemma 35, the operator ((pen I G/c^cs is an isometry from K, ® ran (4>u) into 
£, proving (112). 

To prove (111), note that equations 114, 105, & 95 imply that 



Zs^s (4>£u\ G K ^c£ = {4>cu\ Wk^>ceR+ {4>u) = {4>ch\ Wic^C'S- 
Equation 111 follows, since by (110) 

Zs^sPs^S = Z £ ^ £ (4>C'U\ G K ^C'£ fJ-KU { W ^) C£^K \^Ch) 

It remains to prove equation 108. By equation 88, 

(G (+) ,G) =\\Q^ce\\ 1 , 

where by equations 87 and 90, 



(115) 



(116) 



Qic^cs = Tr(|</>£ W ) (</>£'«! G/c->c'£Wc-h) 

= {4>C'H\ G/C->£'£ \±KU \4>CU) i 

where £ is a copy of C. It follows by equations 105 and 95 that 

QPQ = {<Pch\^khX+ (<M M/cw Wch) 
= (<Pch\ itfcu \<t>cn) ■ 

The conclusion follows. ■ 



(117) 



(118) 
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4.2.4 The "Quadratic Overlapper" 

The following operation is analogous to the quadratically- weighted measurement: 

Definition 37 The "quadratic overlapper" is the operation 71°-° : B 1 (JC) — > B 1 (C) defined by 

lZQ°(v K )=TrG£\ C£ v K (G^)', (119) 

where Gfc^cs G V^rf * s ^e "small-angle guess" of Lemma 35, with directional iterate G^\ C£ given 
by equations 86-87 of Theorem 32. 

Alternatively, one may express IV* using 

Theorem 38 (Computation of the quadratic overlapper) One has the formula 

K%Zc (vk) = Tr (j? KH ((Y-^vY-^)^ ® \<j>) cn (0|)) , (120) 

where 

YK^K = {4>cu\tfc H \<t>cu), (121) 

and p,KM is given by (97). 



Proof. By equation 86, the guess G has the iterate 

^icXcs = Q [Q Q 



-1/2 + 

3 

where the operator Q is defined by (87). By equations (117)-(118), one has 

Qk^cs = (4>ch\ Gic^cs V-ku \4>ch) 
<3 T <2 = (4>cn\ VKH-yJCH \<t>C-H) = ^re- 
setting 

v K ^ = Y- 1 ' 2+ vY-V*\ (122) 

it follows that 

,(+) /'-./.At 



= Tr ((4>cu\ Gic^cs fJ-tcn \4>ch) vk^k. (4>ch\ Vkh { g ^)c's^k \^C'h)) 

= Jr [p KH |</>ch) v/c^ic (<f>cn\^KH ( g ^) C 'e^k (<l>C'n\Gic->c'e) (123) 

= Tr (pkh (vk^k ® |0) £W (</>!) /ijcwH+ (0w)) (124) 

rt 

= Tr (Ak«(^k®|0> £ „(^|)). (125) 

Here equation 123 uses cyclicity of the trace, equation 124 uses (105), and equation 125 uses (95), 
(97), and cyclicity of the trace. ■ 

4.2.5 Estimates for the restricted maximum overlap problem 

On now may apply our angle estimates in a manner similar to that of section 3.3: 

Theorem 39 (Two-sided estimates for the maximum overlap problem) Let pk/h be posi- 
tive semidefinite on 1C ®7i and let \4>ch) be a unit vector. Then 

A 2 

< (<t>cn\'R>ic-+c \<I>cu) < max (ipCH^K^C (pkh) \4>ch) 



< A x 



(K" pt )L K d^) V2 < A, (126) 
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where the maximum is over quantum operations 1Z : B 1 (/C) — > B 1 (C), where lZ opt attains this 
maximum, and where 

(127) 



A = Tr 

K 



Here picu is given by (97), V} is given by (47), and one interprets 2 /0 = 0. 
Remark: It follows from (126) that 



A<Tr(/ktt) x 



[n 



opt\t 



(\(/>cu) (<t>cu\) 



1/2 



< Tr/t ra . 



Note: Given an arbitrary invertible operator X : % — > "H, one may obtain potentially sharper 
estimates from inequality (126) using a replacement of the form 



PKH ~> \\Xu-YH<i> 'Ch\\ 2 (X 1 ) t /i/ C 'H^ L , 

which does not change the overlap (4>ch\ T^K-^c (pku) \4>cn)- 



(128) 
(129) 



Proof. Since all quantities in (126) scale linearly in p, set Tr fijcu = 1- (The case fi = is trivial.) 

Let 7l° pt attain the maximum in (126). Take Gjc^cs to be the "small angle" guess of Lemma 
35, with iterate G^2y^ £ given by Theorem 32. By Lemma 36, there exists a Stinespring dilation 
WfcUcs 01 Tt- opt such that the angle estimate (106) holds in the equivalent form 



(W op \G) 



> 



^chI^kXc (P-icn) \<Pcn) 



given by (113). But by lemma 7 and (107) 



|vF opt || > 



G (+) 



(K°*)Uic(\<t>cu)(<t>£n\) 
> A(G) > (W op \G) 



1/2 



where by (14) and (107) - (108), 

A(G)=Rc(G(+),G/||G||^) 



A. 



/<■<?■ 



(130) 



(131) 



(132) 



The third inequality of (126) follows by appending (130) to (131). Squaring the first three quantities 
of (131) proves the first two inequalities of (126). The final inequality follows from the fact that 
lZ opt is a quantum operation. ■ 

4.3 Estimates for quantum conditional min-entropy 

Theorem 39 has the following corollary: 

Corollary 40 Let Ha and "Kb be finite- dimensional, and let pab be a density matrix onH.A®H.B- 
Then for any s£M the conditional min-entropy (10) of A given B satisfies the bounds 



log 2 [y/Trfii x Tr JT±pabPa"pab) < H mm (A\B) p < - log 2 



Ti's \fTrA Pab Pa' * Pab 
^Pa S 

V J 

(133) 

where pa = Trg Pab- Here any non-positive powers are evaluated as in equation 18. (In particular 

p\ := n + (p A ) , (134) 
where the RHS is the positive projection (4-5).) 
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Remarks: 

1. The s — case of (133) is particularly simple: 

1 



log ( Tr ^Tr p\ B j - log (rank (p A )) < H min (A\B) p < -2 log 2 Tr ^Tr p\ B 



(135) 



2. If s = 1/2 and pab = IV'ab) (V'abI is pure then a simple calculation^ 113 ! shows that the upper 
and lower bounds of (133) agree, yielding the known [70] expression 

H min (A\B) h , ab) = -2 log 2 (Tr ^l) • (136) 



3. More generally, if pab = \^ab) {'4>ab\ is pure then the upper bound of (133) is exact and 
independent of s. 

4. If pab is a maximally-entangled pure state then the lower bound of (133) is also exact and 
independent of s. 

Proof. Let 

PA = \k) A (k\ 

be a spectral decomposition. Then by equation 12 one has the identity 

2 -H min (A\B) a _ 



where 



\^s) A A* 



AB 



" = ^ j s\aa* max(ft B _>A* (Vs)ab) U>s)aa* > 



A s/2 

k with A fe >0 V ±L "A 



a/2 -s/2 -s/2 

Pa Pa PabP a ■ 





AA* 




s/2 

Pa 


2 



(137) 

(138) 
(139) 



The bounds (133) follow by Theorem 39. 



Remark: Using the fourth term of inequality 126, one may tighten the lower bound of (133) in 
cases where one can estimate 

(^ opt )L^(i^)(^i) 

where lZ opt is a maximizer of (137). Appendix C shows that this works in the case that pab is a 
"quantum-classical" state. 



5 Approximate Channel Reversals 

This section applies Theorem 39 to estimate the reversibility of an arbitrary quantum operation 
A : B 1 (H) — s- B 1 (/C), as measured by entanglement fidelity max7j K ^ H F e (p,lZoA). (Note that 
more generally Theorem 39 gives estimates when the input state of A and target output state of 1Z 
differ, but we focus on this special case.) 

In order to express the our reversibility estimates in a more intuitive form (and to understand the 
relationship of the corresponding reversal with that of Barnum and Knill) , it is useful to introduce 
a method for applying functions to CP maps. 
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5.1 The p- functional calculus for CP maps 

One may tailor the Kraus decomposition [89] a CP map to a given input density matrix p: 

Definition 41 Let A : B 1 {%) — > B 1 (K.) be a completely-positive map and let p be a density matrix 
onT-L. A p-Kraus decomposition of the restriction of A to B 1 (supp(p)) is a decomposition of 
the form 

A 0) = ^lPkE k pEl, supp 0) C supp (p) (140) 

where 

1. The vectors Ek \ipp) G K <8> H* are orthonormal, where ip p is the purification (66). 

2. The pk are non-negative. 

3. The p-Kraus operators Ek : T~L — >• K, have supports in supp (p). 
Remarks: 

1. Existence of a p-Kraus decomposition follows by an easy modification of standard techniques. 
(See Proposition 43, below.) 

2. When A is trace-preserving, one interprets A as acting on p by randomly sending the purifica- 
tion \ip p ) HH * into one of the orthonormal vectors Ek \ip P ) 1 which are classically-distinguishable 
by the observer with access to KM* . 

3. By equations 66 and 61, the orthonormality of the Ek \ip P ) H - H * is equivalent to the condition 

Tr (ElE t p) = Ski- (141) 

If p is maximally-mixed one therefore obtains the usual orthogonality conditions [90] some- 
times required for the Kraus operators. 

Given a state p, there is a natural notion of applying functions to completely positive maps: 

Definition 42 (p-functional calculus for CP maps) Let A : B 1 (H) — > B 1 (JC) be a completely 
positive map with the p-Kraus decomposition (140). For f : [0,oo) — > [0, 00) one defines the CP 
map f p (A) : B 1 (supp (p)) -> B 1 (JC) by 

(f p (A))(p)=J2f(Pk)EkVEi (142) 

The quadratic reweighting A^ 2 ' p ^ of A corresponds to the case f (p) =p 2 : 

A^(u)=J2plE k »El (143) 

The following proposition shows that the CP maps f p (A) and A^ 2 '^ are well-defined and inde- 
pendent of the decomposition (140): 

Proposition 43 Let f , A, and p be as in Definition 42. Then 
1. A p-Kraus decomposition (140) of A exists. Furthermore, 

^2?k = TrA(p) , 
so that {pk} is a probability distribution if A is trace preserving. 
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2. One has the identity 

(f p (A)) (p) = Tr {{p- 1/2+ ^p- 1/2+ ) n ^ H , x / (A (|^W (Wl))) , (144) 

for all p £ B 1 (supp (p)), where barred product of operators acts on H* as indicated by equa- 
tions 56-60. Here one applies f to a self-adjoint operator using the functional calculus [86]: 
Given a spectral decomposition 

A = Y^htti, (145) 

one writes 

/(A) = ^/(A 4 )IT. ( 146 ) 

3. In particular, the CP map f p (A) is independent of the choice of decomposition (140), and 

A<™ (M)=£((? =i 7 5 W^ w ._ +w . x {A^ p ) nu ,m)) 2 )> (147) 
for p, € B 1 (supp (p)). 
Remarks: 

1. If / (p) = p is the identity function and if p is maximally mixed then equation 144 reduces to 
the usual procedure for recovering a channel from its Choi matrix (67). 

2. Equation (147) gives the form of A^ 2 '^ which appears when one applies Theorem 39 to obtain 
reversibility estimates. The transpose of p becomes a partial transpose (60) when A 1 - 2 '^ is 
applied to the state of a composite quantum system. 

3. A prescription for computing A^ 2 ' p ' from an arbitrary set of Kraus operators of A appears in 
the next section. 

Proof. By equations 62, 66, 18, and 45, 

^ Ife = k 17 V 1/2 V /2 ))^ = |n + (P)p 1/2 )) nm - IVvW ■ (148) 

It therefore follows by equation 68 that for densities p € B 1 (H) 



A(p) = TrA(\^) Hn . =Tt M 1/ V 1/2+ ^(l^) w «. M p-VaV' 3 . (149) 

ri. ri 

Taking a spectral decomposition 

■A (IW) (Vvl) = 5> \ F *))kh* , (150) 
it follows from equations 149, 62, and 64 that 

A(p) = g5>* h^ /2+ V /a )) KH , ((F k p- 1/2+ p 1/2 \ = J2PxF kP - 1/2+ Pp- 1/2+ Fl (151) 
Setting 

E k = F k p- 1/2+ (152) 

gives the desired p-Kraus decomposition, where the desired condition (141) follows from the or- 
thonormality of the \Fk)) £ K, ® H* using equation 61. 
If A is trace-preserving then 

1 = A {\i'p)nv.' H'p\) = Tr t ^PkFk \ip P ) nn , (ipp\ El = 

by the defining orthonormality condition on the {Ek}, proving that {pk} is a probability distribu- 
tion. 
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Now suppose that we are given an arbitrary p-Kraus decomposition (140) and that p £ B 1 (supp (p)). 
Note that since both sides of (144) are linear in p we may assume without loss of generality that p 
is positive semidefinite. Then by equations 68, 148, 146, 140, and cyclicity of the trace 



(p fc ) E k pE\ = Tr J2f (Pk) E k \%) nn . {^\E\ 



= Tr 
H* 

= Tr 

H* 



p-V2+ w -V2+ x/^fl^W (^1)) 



as desired. 



5.2 Quadratic quantum error recovery 

Theorem 44 Let A : B 1 (U) ->■ B 1 QC) be a quant um operation, and let p be a density matrix on 
% . Then one has the following bounds on the optimal entanglement fidelity of recovery 



A 2 

„ , f s <F R (p,K QR oA) < sup F e {p,KoA)<k, 
where the supremum is over quantum operations 1Z : B 1 (fC) — > B 1 {%), where 



A = Tr JAP'P) {p 2 )<TrA{p), 



(153) 



(154) 



where A^- 2 ' p,> is given by Definition 42 (see also equations 147 and 172, below), and where quadratic 
recovery operation is given by 



ki r (v) = p (a^) 1 ( ( P 2 )Y 1/2+ 



Here the adjoint (•) is from Definition 24- 



A (2 - p) 



-1/2 H 



(155) 



Remark: In the case that A is trace-preserving, one may plug the square of the last inequality of 
(153) into the first inequality (153), giving 



F e (p,Tl^ R oA) A 2 

T-i i J > ET7— p TV _ sup F e (p,TZoA). 

su Pk k ^„ F e [p,HoA) sup UK: ^ n F e (p,KoA) n K ^ n 



(156) 



In particular, both of the lower bounds of (153) are sufficiently tight to also satisfy the tightness 
relation (34) of Barnum and Knill [14]. (Note, however, that Barnum and Knill also produce esti- 
mates for average entanglement fidelity, under certain commutativity assumptions.) Furthermore, 
by expressing the bounds (153) in terms of the infidelity 1 — F e (p, 1Z a A) one obtains the fact that 
7?Q R , like 7Z BK , has an infidelity of recovery within a factor of two of the optimal. 

Proof. Let H ln be a copy of T-L, let IVvO-h-w* be the canonical purification (66) of p, and set 

to = Au in ^K. (lVv)« ln -H* (Vvl) ■ 

Using the replacements (%,£,£) — > (H* ,IC,H) and \<j>) CH — ^ \^ P ) HH * 1 Theorem 39 gives estimates 
of the form 

A 2 

< F e (p, o A) < sup F e (p, K o A) < A. (157) 

We claim that Tr p KH * = Tr A(p), A = A, and ll QO = TZ QK . 
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First claim: Note that 

= n+ (p H , ) Au^k ( \Vp)) Uln H* ((VP I ) n + iPw) 
= An^K (lVP n + (p)»« toW . ((Vpn+ 



A, 



(158) 
(159) 
(160) 
(161) 



where the first three equalities used (97) & (94). (69), and (62). It follows by equation 68 that 



£L Aor = Tr An in ^ic (Tr \ip 



KM 



H* 



"P'H in H 



TvA(p). 



as desired. 

Second claim: One computes 



(VV>I«W (Ak«*) 2 \^p)uw 

= ($p\ HH * (A-h^ic (\ip P ) n . nn , (V'pl)) \^ P ) UH . 

>K {\^p) n . nU , (Vvl)) \^p)hU* WV>I 



Tr 



A 



= Tr 

w 



(ah^k {\i>p) n . mn * (i'pij) 

-A [2 - p) (o 2 ) 



Pu> 



(162) 
(163) 

(164) 
(165) 



where Wi n is a copy of H and where our steps (in sequence) used (161), cyclicity of the trace, (69), 
and (147). That A = A now follows from equation 127. 
Third claim: By equation 120 



K 



QO 



0) = [(A/ch* ) 2 ((y-^vY-^)^®^)^, (v, p |) 



where by (121) and (165) 

Y K ^=A^ (p 2 ), 
But by (66) & (62), (147), and (65) one has 



(166) 
(167) 



Tr 



Tr 

n* 



An,, 



IV 



_-l/2+ 



'HH* 



_-l/2+ 



.4 



Six: ® a« 



PT (|p» 

B 2 ("H*)^B 2 ("H in ) 



TO* 



where is a copy olW. So setting 



-1/2 + 



2,P) 



-1/2 + 



(168) 



(169) 
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and substituting (168) into (166) gives 

Kk^c (v) = Tt {{fucu-f (X K ^ K ® \i> p ) nH , (Vp|) 
= Tr (Ti [[fiK,H* f \i>p) Hn - (Vvl) X K ^ K 
= Tr (Ug^ {pu^n m ® Pu^u) X 

K. \ 



Tr 

Win 



(ph^Qph^h) ((a^V (X) 



PU-^U ( (- 4(2,P) ) K _ >H . W ) 



(170) 



This proves the claim. 

The inequality A < Tr.4. (p) follows from (153). ■ 

The following proposition puts our recovery bounds into a form closer to the nearly simultaneously- 
appearing bounds of Beny and Oreshkov (Theorem 21, above): 

Proposition 45 Suppose that p is a density on T~L and that the quantum operation A : B 1 (H) — > 
B 1 (JC) has a Kraus decomposition of the form 



A( l i) = J2 F k^l> 



(171) 



where the are not constrained to satisfy any orthogonality conditions. Then for fi £ B 1 (supp (p)) 
one has 

a {2 - p) ( f i) = j2 F ^ iF i x Tr ( F i F ep) ( i72 ) 

u 

Proof. Since both sides of (172) are linear in fx, we may assume without loss of generality that p, 
is positive scmidcfmitc. The conclusion follows using equations 147, 148, 66 & 62, and 61 & 64 (in 
said order): 



A^ (p) = Tr ^Va p -i/a+ x (^F k |WW (^ P \F^ 
= £ ( E F * '^W W ^ IVvW <<M 



X p-V2+ M l/2 



5> fe pFt x Tr (i^p) . 

kl 



5.3 The relationship between the Quadratic Recovery and Barnum and 
Knill's reversal 

As we have already seen in equation 156, the quadratic reversal 1Z$ R and the simple lower bound 
of (153) are both sufficiently accurate to also satisfy the tightness relation (34) of Barnum and 
Knill. This section makes a brief comparison of the quadratic reversal operation with the reversal 
map of Barnum and Knill (for the special case of non-average entanglement fidelity) in light of the 
relationship between the quadratic measurement and the PGM. 
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Re-expressing the elements of the ensemble (2) as pk = PkPk, where Tr (p^) = 1 and pk = Tr pk 
is the chance that p>k appears, the formulas for the "pretty good" and quadratically-weighted 
measurements become 

M fe pGM = ($>/*)~ i/2+ pxp- (5>w)~ 1/2+ 

Mr - (z^i)~ 1/2+ pipt (x> 2 <f 1/2+ ■ 

In particular, to get from the pretty-good measurement to the quadratic measurement, one replaces 
all probabilities and density matrices by their squares. 

A simple examination of the formulas (33) and (155) shows that a similar relationship exists 
between the entanglement fidelity case of the Barnum-Knill reversal 1Z BK and the quadratically- 
weighted reversal 7?.^ R . Note that the corresponding probabilities pk, which must be replaced by 
their squares, are viewed as being hidden in the p-Kraus decomposition (140) of the reversed map 
A. 

In [83] various weightings for Bclavkin pure-state square-root measurements were compared, and 
it was argued that Holevo's quadratically-weighted measurement had qualitative and quantitative 
advantages over the linearly weighted PGM. Based on analogy, we conjecture that 1Z Q R will typically 
(but not always) outperform TZ BK . 

5.3.1 Depolarizing noise and the quadratic transpose channel 

It is perhaps interesting to quantitatively compare the actions of the Barnum-Knill reversal 1Z BK 
with the quadratic recovery 7?Q R in the simplest special case, in which depolarizing noise 

(p)=px ^-^Tr(/i) + (l-p)xp, pe[0,l] (173) 

acts on half of a maximally-entangled state. (Note that the p = 1/ dim'H case of 1Z BK , also known 
as the transpose channel [84] , has recently [20] been employed in the study of approximate quantum 
error correction.) For dimH > 1, one easily obtains 

Tl BK = Ap (174) 

^p R =- 4 /( P ,dim«), (175) 

where 

P 2 

f (p, dim«) := * (176) 

{l-pf {AimUf + {2-p)p 

satisfies 

/(p, dim ft) < f(p,l) =p 2 . (177) 

In particular, both recovery operations "correct" depolarization errors by committing further de- 
polarization. Fortunately, however, when p < 1 the quadratic recovery depolarizes with lower prob- 
ability than the transpose channel, especially when dim'H is large or p is small. 

A more detailed quantitative comparison of 7Z BK , 1ZQ R , and of reversals of other possible weight- 
ings (perhaps generalizing the cubically- weighted measurement of [91, 92]) will be left for future 
work. 



6 Conclusion and future directions 

We have generalized the iterative schemes of Jezek-Rehacck-Fiurasck [74], Jezek-Fiurasck-Hradil 
[6, 75], and Reimpell- Werner [7, 10]. Using an abstract framework, "small angle" guesses were 
employed to construct concise two-sided bounds for minimum-error quantum detection, maxi- 
mum overlap, quantum conditional min-cntropy, and the reversibility of quantum dynamics. An 
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approximately-optimal channel reversal and overlap operation were derived. The resulting bounds 
were sufficiently tight to also satisfy the tightness relations of Barnum and Knill [14], although 
our methods more generally allowed the target state and the input state to differ. Our recovery 
operation was found to be a significant improvement of the transpose channel in the simple case of 
depolarizing noise acting on half of a maximally-entangled state. 

As a direction for future study, we note that Barnum and Knill constructed an approximate 
reversal operation in the more general sense of average entanglement fidelity, albeit with commuta- 
tivity assumptions of unknown necessity. A remaining open question is whether one can generalize 
our quadratic reversal construction to this case of average entanglement fidelity, and whether these 
commutativity assumptions may be removed. More generally, one may ask how to obtain estimates 
for the maximum overlap problem without our assumed purity of the target state. The principle 
difficulty in answering both of these questions is in finding an appropriate "small angle guess," in 
the sense of lemma 7. 

Another future direction, in which we have made recent progress [93], is to employ matrix 
monotonicity to obtain bounds for the maximum overlap problem, including its special cases of 
channel reversibility and quantum conditional min-cntropy. 

Appendix A: Canonical Stinespring dilations 

Using only the square root function and the natural isomorphisms of Section 2.1, one may construct 
Stinespring dilations which are independent of any choice of a basis: 

Definition 46 Let TZ : B 1 (K.) — > B 1 (£) be a completely positive map, with K. finite-dimensional. 
The canonical environment is given by 



where C £ and K,£ are copies of C* and K,, respectively. The canonical Stinespring dilation 

Utz of 1Z is the linear transformation U-r : K, — > C ® £ such that \UTifl csk* = W1)ck*c*k * s ^ e 
canonical purification (66) of the Choi matrix TZ = 1Z( \1)) KK , ((A )- 

That U-jz is a bona fide purification of TZ follows from the following lemma: 

Lemma 47 Let 1Z : B 1 (K.) — > B 1 (C) be a quantum operation, with IC finite-dimensional. Then 
Uteres ^ a Stinespring dilation of 1Z iff |C/)) £g K « is a purification of the Choi matrix 1Z. 

Proof. Suppose that U dilates 1Z. Then by equation 62 



so \U)) purifies 1Z. 

Conversely, suppose that \U)) C£lc « purifies TZ, and let v G B 1 (K.) be a density matrix. Then by 
equations 64, 62, 66, and 68, 



£ = C* £ (g) JCe, 



(178) 



TZ = TrU^cs PI (Uk^ce) ] = Tr \U)) C£K , ((U\ , 



Tr (Uk^csvk {Uk^cs^ 



) 



Tr (V)^ W)) C£K , ((U\ (v 



f 1/2 



) 




(179) 
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Appendix B: Reimpell- Werner iteration as directional itera- 
tion 



The purpose of this section is to verify that Reimpell- Werner iteration (introduced in section 1.4.2) 
for CP maps corresponds to directional iteration of the corresponding Stinespring dilations. 
One may re-express the maximized functional / (TV) of equation (21) as 

f(K) = \\Un\\ 2 F , (180) 

where Un is a Stinespring dilation of the CP map 7Z : B 1 (JC) —> B 1 (C) and the seminorm is defined 
by 

Definition 48 Let £ = C* £ ® JCg be the canonical environment (178) for quantum operations from 
K, to C For operators U,W : JC C ® £ , define the semidefinite inner product 

(U,W) F = ((U\ CK£ , Fac*^cfc* \W)) CK£ , , (181) 

where A t— > |^4)) is the isomorphism of equation 58. Let Vf = {U \ \\U\\ F < oo}, on which 
is a well-defined semidefinite inner product. Let S ~ {U \ \\U\\ < 1}. 

Theorem 49 (Reimpell- Werner iteration is directional iteration) Suppose that Uyc^>C£ * s 
a Stinespring dilation of a CP map 1Z : B 1 (JC) — > B 1 (£). Then U £ Vf has a directional iterate 
£A + ) € S which dilates the Reimpell- Werner iterate TZ® of Def. 11. 

Proof. Let X/c^-cs be the operator defined by 

\Xk^ce)) = FcK'^CK* \U)) C£IC , ■ 

Then by equations 181, 43, and 44, 

max Re {W,U) F = maxRcTr (W K ^ C£ )^ X K ^ C£ = WX^ceh, 

with maximizer W = given by 

c/(+) = x(x^xy 1/2+ . 

By equations 62, 63, and 23 one has 



= (TcF LK ^ CK * \U)) C£K , ((U\ F CK ^ CK ,) V * F CK ,^ CK , \U)) C£K , 



T~ 1/2+ Fck^ck-Uk^ce II) 



It follows from Lemma 47 and Eq. 22 that U {+) dilates 7£ 9 
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Appendix C: The relationship between overlap bounds and 
state distinguishability 

As remarked in section 4.2, Theorems 1 and 2 of [70] (see also equations 186-189, below) imply that 
minimum-error distinguishability of a finite collection of quantum states £ = {pk}u = i m may be 
expressed in terms of restricted maximum overlap: 

f S ucc (M opt ) = m x max (fo™ (C m). | TL n ^ {C my (m«c™) |c?W"») • (182) 

Here the vector </> <E C m <£> (C m )* is the maximally- mixed state 

kwr) = |1>W- := l fc )c- |*>c»» , (183) 

and fj, G B 1 (H<g) C m ) is the "quantum-classical" state 

m 

M«®c- = <8 |fc) Cm , (184) 

fe=i 

where the pk are normalized as in Definition 1. 

If one applies the overlap bounds of Theorem 39 (or the s = case of Corollary 40 combined 
with Eq. 12), one obtains 



Tr JJ^pI < ^ucc (M QW ) < Psncc (M opt ) < (7e opt ) t (|«# (</>!) x Tr J^A 



< ^xTr^>|. (185) 

In particular, if one neglects the factor in the fourth expression of this estimate then one 

picks up a spurious factor of yfm not appearing in the bounds of Theorem 15. (Weakness of the 
upper bound is not surprising, since <j) and \x are generally not "reasonably overlappable.") 

In order to show how one may apply the fourth term of the overlap estimate (185), we give 
another proof of Theorem 15. It is hoped that similar methods may lead to sharper upper in other 
instances of maximum overlap or conditional min-entropy. 

An "overlap proof of Theorem 15. We restrict consideration to the case £ = . 
Given a quantum operation IZ^^^m one has the identity 

to x (0 C ™(C™)*| ft«-KC">)* (Mwc0 |0c™c"») = fsucc (M n ) , (186) 

where p £ B 1 (H (3 C m ) and (j> £ C m <g> C m * are as in equations (183)-(184) and where the POVM 
M TC corresponding to the operation 1Z is given by 

Mf := {Tl)\ cmr ^ n (|fc) cm , (*|) , k = l,...,m. (187) 

Since any given POVM M may be expressed in the form of (187) for the quantum operation 
1Z = 1Z M given by 

m 

V) := E l £ >c- (*l x Tr ( M ^) . ( 188 ) 
fc=i 

maximization of (186) over operations 1Z gives the identity (182). Taking M opt to be some optimal 
measurement, it follows that a maximizer of the LHS of (186) is given by 



1Z opt = K M ° vt , (189) 
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where M opt is an optimal measurement. One estimates 




(|<^) cmcm , (0|) 



1 



$>) Cm <fc| ® M fc ° 



opt 



1 

< — . 

171 



(190) 



OO 



m 



oo 



Applying the bounds (126) to (186) yields the chain of inequalities 




(191) 



Acknowledgements: We would like to thank Arthur Jaffe, Peter Shor, and Chris King for their 
encouragement, Arthur Jaffe for suggesting a change in presentation, Andrew Fletcher, Frederic 
Dupuis, Cedric Beny, Ognyan Oreshkov & Renato Renner for valuable discussions, Stephanie 
Wchncr for pointing out the work of Ogawa and Nagaoka, the anonymous referee for valuable 
comments, and a previous referee for [22] for pointing out the connection to min-entropy. 

Note Added: Private communication from the authors of [18] indicates that they have obtained 
the quadratic recovery channel by alternative means [19]. 



References 

[1] A. S. Holevo, "Remarks on optimal measurements," Problems of Information Transmission 
10, no. 4 317-320 (1974); Translated from Problemy Pcredachi Informatsii, 10 no. 4, 51-55 
(1974). 

[2] H. P. Yuen, R. S. Kennedy, and M. Lax, "Optimum testing of multiple hypotheses in quantum 
detection theory," IEEE Trans. Inf. Theory, IT-21, 125 (1975). 

[3] A. S. Holevo, "Statistical Decision Theory for Quantum Systems," Journal of Multivariate 
Analysis 3, 337-394 (1973). 

[4] S. M. Barnctt and S. Croke, "On the conditions for discrimination between quantum states 
with minimum error," J. Phys. A: Math. Theor. 42 062001 (2009); e-print arXiv:0810.1919. 
Note: The presented argument becomes even shorter if one uses Holevo's [1] simplified con- 
ditions for optimal measurements, which are precisely given by the positivity of the operators 
Gj of equation (10). 

[5] B. Schumacher and M. Westmoreland, "Approximate quantum error correction," Quantum 
Information Processing 1, 5-12 (2002); arXiv:quant-ph/0112106. 

[6] M. Jezek, J. Fiurasek, and Z. Hradil, "Quantum inference of states and processes," Physical 
Review A 68, 012305 (2003); quant-ph/0210146. 

[7] M. Rcimpell and R. F. Werner, "Iterative optimization of quantum error correcting codes," 
Phys. Rev. Lett. 94, 080501 (2005); e-print arXiv:quant-ph/0307138. Note: The authors 
were unaware of the previous work of [6]. Furthermore, despite much suggestive languange, 
Rcimpell and Werner do NOT prove that their iterates converge when the encoder or decoder 
is held fixed. (See [10], below.) 

[8] A. S. Fletcher, P. W. Shor, and M. Z. Win, "Fletcher Shor Win Optimum quantum error 
recovery using semidefinite programming," Phys. Rev A 75, 012338 (2007); e-print arXiv: 
quant-ph/0606035. 

[9] M. Rcimpell, R. F. Werner, and K. Audcnacrt, "Comment on 'Optimum quantum error 
recovery using semidefinite programming," e-print arXiv:quant-ph/0606059. 



35 



[10] M. Reimpell, "Quantum information and convex optimization," Ph.D. The- 
sis, Braunschweig, Technishe Universitat, 2007, urn:nbn:de:gbv:084-17795; 
http://deposit.ddb.de/cgi-bin/dokserv?idn=988217317. Note: The reported "stability" 
result of Theorem 3.2.13 is not by itself sufficient to prove that the channel power iteration 
converges to any limit, even when started in an arbitrarily small neighborhood of the global 
maximum. For example, the mapping / (x) = x + x 3 on tt satisfies Rcimpell's definition of 
"stable" about the fixed point x = 0, since ||/'(0)|| < 1. Note, however, that the iterative 
sequence x, f (x) , / (/ (x)) , . . . diverges unless x = 0. Furthermore, consideration of iterated 
rotations of the unit ball in R 2 shows that an assumption of compactness is no remedy. 

[11] A. S. Fletcher, P.W. Shor, and M. Z. Win, "Channel- Adapted Quantum Error Correction for 
the Amplitude Damping Channel," IEEE Trans. Inf. Theory, 54 5705-5718 (2008); e-print 
arXiv:0710.1052. 

[12] R. Kosut, A. Shabani, D. Lidar, "Robust quantum error correction via convex optimization," 
Physical Review Letters 100, 020502 (2008); quant-ph/0703274. 

[13] S. Taghavi, R. L. Kosut, and D. A. Lidar, "Channel-Optimized Quantum Error Correction," 
e-print arXiv:0810.2524 (2008). 

[14] H. Barnum and E. Knill, "Reversing quantum dynamics with near-optimal quantum and 
classical fidelity," J. Math. Phys. 43, 2097 (2002); e-print arXiv: quant-ph/0004088. Note: 
See [108]. 

[15] A. S. Fletcher, "Channel- Adapted Quantum Error Correction," Ph. D. Thesis MIT Cam- 
bridge, MA 2007; e-print arXiv: 0706.3400. 

[16] A. S. Fletcher, P. W. Shor, and M. Z. Win, "Structured near-optimal channel-adapted quan- 
tum error correction," Phys Rev A 77, 012320 (2008); c-print arXiv: 0708.3658. 

[17] Naoki Yamamoto, Shinji Hara, and Koji Tsumura, "Suboptimal quantum-error-correcting 
procedure based on semidefinite programming," Physical Review A 71, 022322 (2005); 
quant-ph/0606105. 

[18] C. Bcny and O. Orcshkov, "General conditions for approximate quantum error correc- 
tion and near-optimal recovery channels," Physical Review Letters 104, 120501 (2010); 
arXiv:0907.5391. Note: The term "fidelity" corresponds to the square root of the fidelity 
as defined above. The statement "However, we will show that when M. is a projection (which 
is the case for error correction)..." before Corollary 3 should read ". . . (as may be taken to be 
the case for error correction by setting Ai (p) = a x Tr p for some density a in the domain of 
M = U)." 

[19] C. Beny and O. Oreshkov, in preparation. 

[20] Hui Khoon Ng and Prabha Mandayam, "A simple approach to approximate quantum error 
correction based on the transpose channel," E-print arxiv:0909.0931. 

[21] R. Renner, "Security of quantum key distribution," PhD Thesis, ETH Zurich (2005); 
quant-ph/0512258. 

[22] Jon Tyson, "Two-sided estimates of minimum-error distinguishability of mixed quan- 
tum states via generalized Holevo-Curlander bounds," J. Math. Phys. 50, 032106 (2009); 
arXiv:0907.2094. Note: The author was unaware that the presented upper bound on dis- 
tinguishability was a special case of a bound in [23]. This was pointed out in the erratum 
[94]. 

[23] T. Ogawa and H. Nagaoka, "Strong converse to the quantum coding theorem," IEEE Trans- 
actions on Information Theory 45, 2486-2489 (1999); quant-ph/9808063. 



36 



[24] C. W. Helstrom, Quantum Detection and Estimation Theory (Academic, New York 1976). 

[25] P. Hausladcn, R. Josza, B. Schumacher, M. Westmoreland, and W. K. Wootters, "Classical 
information capacity of a quantum channel," Phys Rev A 54, 1869 (1996). 

[26] B. Schumacher and M. D. Westmoreland, "Sending classical information via noisy quantum 
channels," Phys Rev A 56, 131 (1997). 

[27] A. S. Holevo, "The capacity of the quantum channel with general signal states," IEEE Trans. 
Inf. Theory 44, 269 (1998). 

[28] L. Ip, "Shor's algorithm is optimal," http://lawrenceip.com/papers/hspsdpabstract.html 
(2003). 

[29] D. Bacon, A. M. Childs, and W. van Dam, "Optimal measurements for the dihedral hid- 
den subgroup problem," Chicago J. of Theoret. Comput. Sci. 2006, (2006); e- print arXiv: 
quant-ph/0501044. 

[30] D. Bacon, A. M. Childs, and W. van Dam, "From optimal measurement to efficient quantum 
algorithms for the hidden subgroup problem over scmidirect product groups," Proceedings of 
the 4.6th IEEE Symp. Foundations of Computer Science, (IEEE, Los Alamitos, CA, 2005), 
pp. 469-478 (2005). 

[31] A. M. Childs and W. van Dam, "Quantum algorithm for a generalized hidden shift 
problem," Proceedings of the 18th ACM-SIAM Symp. Discrete Algorithms, (Society for 
Industrial and Applied Mathematics, Philadelphia, PA, 2007), pp. 1225-1234; e-print 
arXiv:quant-ph/0507190. 

[32] M. Hayashi, A. Kawachi, and H. Kobayashi, "Quantum measurements for Hidden Subgroup 
Problems with Optimal Sample Complexity," Quantum Inform, and Compu. 8, 0345 (2008) 
; e-print arXiv:quant-ph/0604174. 

[33] C. Moore and A. Russell, "For Distinguishing Hidden Subgroups, the Pretty Good Mea- 
surement is as Good as it Gets," Quantum Inform. Compu. 7, 752 (2007); e-print 
arXiv:quant-ph/0501177. 

[34] D. Bacon and T. Decker, "The optimal single-copy measurement for the hidden-subgroup 
problem," Phys. Rev. A 77, 032335 (2008); e-print arXiv:0706.4478. 

[35] J. Radhakrishnan, M. Rotteler, and P. Sen, "Random measurement bases, quantum state 
distinction and applications to the Hidden Subgroup Problem," Algorithmica 55, 490-516 
(2009). 

[36] Markus Hunziker, David A. Meyer, Jihun Park, James Pommersheim, and Mitch Rothstein, 
"The geometry of quantum learning," Quantum Information Processing 9, 321-341 (2010); 
http: / / arxiv.org/abs/quant-ph/0309059. 

[37] V. P. Belavkin and V. Maslov, "Design of Optimal Dynamic Analyzer: Mathematical Aspects 
of Wave Pattern Recognition" In Mathematical Aspects of Computer Engineering, edited by 
V. Maslov, pp. 146-237 (Mir, Moscow 1987); e-print arXiv:quant-ph/0412031. Note: The 
first two equations on page 40 should be F, = H t (L°)~ 1/2 and M° = (L°) +1/2 D° (L°) +1/2 . 

[38] P. Hayden, D. Leung, and G. Smith, "Multiparty data hiding of quantum information," Phys 
Rev A 71, 062339 (2005); quant-ph/0407152. 

[39] A. Montanaro, "On the distinguishability of random quantum states," Commun. Math. Phys. 
273, 619 (2007); quant-ph/0607011. 

[40] D. Qiu, "Minimum-error discrimination between mixed quantum states," Phys Rev A 77, 
012328 (2008); arXiv:0707.3970. 



37 



[41] A. Montanaro, "A lower bound on the probability of error in quantum state discrimination," 
Proc. IEEE Information Theory Workshop 2008, pp. 378-380; e-print arXiv:0711.2012. 

[42] D. Qiu and L. Li, "Minimum-error discrimination of quantum states: Bounds and compar- 
isons," Physical Review A 81, 042329 (2010). 

[43] A. S. Kholcvo, "On asymptotically optimal hypothesis testing in quantum statistics," Theor. 
Probab. Appl. 23 411 (1978). Note: The displayed equation between (8) and (9) should 
be VVj \\tp 3 ;- ejf = 2 (1 - RcTr (UUT 1 ^ 2 )). The line just after equation (9) should 

i 

read "where V* = (nr 1 / 2 ) {liY 1 ' 2 )" The final expression in the paper should be 

2(i-Tr|r 1 / 2 n|). 

[44] P. J. Curlander, "Quantum Limitations on Communication Systems," Ph. D. Thesis, MIT 
Cambridge, MA 1979. 

[45] M. A. Nielsen and I. L. Chuang, Quantum computation and Quantum information (Cam- 
bridge: Cambridge university press 2000). 

[46] Aram Harrow and Andreas Winter, "How many copies are needed for state discrimination," 
quant-ph/0606131. 

[47] O. Morgenstern and J. von Neumann, Theory of Games and Economic Behavior, Princeton, 
1944. 

[48] D. Aharonov and M. Ben-Or, Proc. 29th Annual ACM Symposium on Theory of Computing, 
p. 176. (New York, ACM 1997). 

[49] A. Kitaev, "Quantum computations: algorithms and error correction," Russian Mathematical 
Surveys 52, 1191-1249 (1997). 

[50] E. Knill, R. Laflamme, W. H. Zurek, "Resilient quantum computation: error models and 
thresholds," Proceedings of the Royal Society of London, Scr. A 454, 365-384 (1998); 
quant-ph/9702058. 

[51] P. Aliferis, D. Gottesman, and J. Preskill, "Quantum accuracy threshold for concatenated 
distance-3 code," Quant. Inf. Comput. 6, 97-165 (2006); quant-ph/0504218. 

[52] D. Aharanov, A. Kitaev, and J. Preskill, "Fault tolerant quantum computation with long- 
range correlated noise," Physical Review Letters 96, 050504 (2006). 

[53] P. Shor, "Scheme for reducing decoherence in quantum computer memory," Physical Review 
A 52, R2493 (1995). 

[54] A. M. Steane, "Error correcting codes in quantum theory," Physical Review Letters 77, 793- 
797 (1996). 

[55] A. R. Caldcrbank and P. Shor, "Good quantum error-correcting codes exist," Physical Re- 
view A 54, 1098-1105. 

[56] E. Knill and R. Laflamme, "Theory of quantum error-correcting codes," Physical Review A 
55, 900-911 (1997). 

[57] C. Bennett, D. DiVincenzo, J. Smolin, and W. Wootters, "Mixed-state entanglement and 
quantum error correction," Physical Review A 54, 3824-3851 (1996). 

[58] R. Laflamme, C. Miquel, J. Paz, W. Zurek, "Perfect Quantum Error Correcting Code," Phys- 
ical Review Letters 77, 198-201 (1996). 



38 



[59] D. Leung, M. A. Nielsen, I. Chuang, and Y. Yamamoto, "Approximate quantum error correc- 
tion can lead to better codes," Physical Review A 56, 2567-2573 (1997); quant-ph/9704002. 

[60] Claude Crepeau, Daniel Gottesman, and Adam Smith, "Approximate quantum error- 
correcting codes and secret sharing schemes," in Advances in Cryptology - EUROCRYPT 
2005, Lecture Notes in Computer Science 3494, pp. 285-301 (2005). 

[61] B. Schumacher, "Sending entanglement through noisy quantum channels," Physical Review 
A 54, 2614-2628 (1996). 

[62] D. Kretschmann, D. Schlingemann, and R. Werner, "The information-disturbance tradeoff 
and the continuity of Stincspring's representation," IEEE transactions on information theory 
54 #4 1708-1717 (2008); quant-ph/0605009. Note: Some ambiguity in the introduction is 
cleared up by Lemma 2.4 of [95]. 

[63] C. Schaffner, B. Terhal, and S. Wehner, "Robust cryptography in the noisy-quantum-storagc 
model," Quantum Information and Computation 9, 963-996 (2009); arXiv:0807.1333 

[64] R. Konig, S. Wehner, and J. Wullschleger, "Unconditional security from noisy quantum stor- 
age," arXiv:0906.1030. 

[65] R. Rcnner, Extracting classical randomness in a quantum world, IEEE Information Theory 
Workshop, 2008. DOI: 10.1109/ITW.2008.4578686. 

[66] C. Schaffner, "Cryptography in the bounded-quantum-storagc model," PhD. Thesis, Univer- 
sity of Aarhus (2007); arXiv:0709.0289. 

[67] A. Leverrier, E. Karpov, P. Grangier, N. Cerf, "Unconditional security of continuous-variable 
quantum key distribution," arXiv:0809.2252. 

[68] R. Konig and R. Renncr, "Sampling of min-entropy relative to quantum knowledge" (2007) 
arXiv:0712.4291. 

[69] R. Renner, S. Wolf, J. Wullschleger, "The single-serving channel capacity," Proceedings 
of 2006 IEEE International Symposium on Information Theory (2006), pp. 1424-1427; 
arXiv:cs/0608018. 

[70] R. Konig, R. Renner, and C. Schaffner, "The operational meaning of min- and max-entropy," 
IEEE Transactions on Information Theory 55, 4337-4346 (2009); e-print arXiv:0807.1338. 

[71] S. Wehner, M. Christandl, and A. Dohcrty, "A lower bound on the dimension of a quantum 
system given measured data," Physical Review A 78, 062112 (2008); arxiv.org:0808.3960. 

[72] Mario Berta, "Single-shot quantum state merging," Ph. D. Thesis ETH Zurich 2008; e-print 
arXiv: 0912.4495. 

[73] Mario Berta, Matthias Christandl, and Renato Renner, "A conceptually simple proof of the 
quantum reverse Shannon Theorem," E-Print arxiv:0912.3805 (2009). 

[74] M. Jezek, J. Rehacek, and J. Fiurasek, "Finding optimal strategies for minimum-error quan- 
tum state discrimination," Phys. Rev. A 65, 060301 (2002); quant-ph/0201109. 

[75] Z. Hradil, J. Rehacek, J. Fiurasek, and M. Jezek, "Maximum-Likelihood Methods in Quantum 
Mechanics," Lecture Notes in Physics 649, pp. 59-112 (2004). 

[76] J. Fiurasek, and Z. Hradil, "Maximum-likelihood estimation of quantum processes," Physical 
Review A 63, 020101(R) (2001). 

[77] M. Sacchi, "Maximum-likelihood reconstruction of completely positive maps," Physical Re- 
view A 63, 054104 (2001). 



39 



[78] M. Mohseni, A. Rezakhani, and D. Lidar, "Quantum process tomography: resource analysis 
of diferent strategies," Phys. Rev. A 77, 032322 (2008); quant-ph/0702131. 

[79] V. P. Bclavkin, "Optimal distinction of non-orthogonal quantum signals," Radio Eng. Elec- 
tron. Phys., 20, 39 (1975). 

[80] V. P. Bclavkin, "Optimal multiple quantum statistical hypothesis testing." Stochastics 1, 315 
(1975). Note: Inequality 4.3 in the statement of Theorem 5 is backwards. 

[81] P. Hausladen, "On the Quantum Mechanical Channel Capacity as a Function of the Density 
Matrix," B. A. Thesis, Williams College, Williamstown, Massachusetts 1993. 

[82] P. Hausladen and W. K. Wootters, "A 'pretty good' measurement for distinguishing quantum 
states," J Mod Optic 41, 2385 (1994). 

[83] J. Tyson, "Error rates of Belavkin weighted quantum measurements and a converse to Holevo's 
asymptotic optimality Theorem," Physical Review A 79, 032343 (2009); arxiv.org:0907.1884. 

[84] M. Ohya and D. Petz, Quantum Entropy and its Use, Texts and Monographs in Physics, 
Springer Verlag, Berlin (1993). 

[85] Ky Fan, "Minimax Theorems," Proceedings of the National Academy of Sciences 39, 42-47 
(1953). 

[86] M. Reed and B. Simon, Methods of Modern Mathematical Physics I: Functional Analysis 
("Academic, New York, 1980). 

[87] W. Forrest Stinespring, "Positive functions on C-* Algebras," Proceedings of the American 
Mathematical Society 6, 211-216 (1955). 

[88] M. D. Choi, "Completely positive linear maps on complex matrices," Linear Algebra and its 
Applications 10, 285-290 (1975). 

[89] K. Kraus, States, effects, and operations (Springer- Verlag, Berlin, 1983). 

[90] P. Arrighi and C. Patricot, "On quantum operations as quantum states," Annals of Physics 
311, pp. 26-52 (2004); quant-ph/0307024. 

[91] Stephanie Wehner, "Cryptography in a quantum world," PhD Thesis: University of Amster- 
dam (2008); arxiv:0806.3483. 

[92] Manuel A. Ballester, Stephanie Wehner, and Andreas Winter, "State discrimination with 
post-measurement information," IEEE Transactions on Information Theory 54, 4183-4198; 
quant-ph/0608014. 

[93] J. Tyson, In preparation. 

[94] J. Tyson, "Erratum: 'Minimum-error quantum distinguishability bounds from matrix mono- 
tone functions: A comment on 'Two-sided estimates of minimum-error distinguishability of 
mixed quantum states via generalized Holevo-Curlander bounds'," J. Math. Phys 50, 109902 
(2009). 

[95] B. Rosgen and J. Watrous, "On the hardness of distinguishing mixed-state quantum com- 
putations," Proceedings of the 20th annual IEEE conference on computational complexity, 
344-354, June 11-15, 2005; arXiv:cs/0407056. 

[96] Yonina C. Eldar, Alexandre Megretski, and George C. Vcrghcsc, "Designing Optimal Quan- 
tum Detectors Via Scmidcfinitc Programming," IEEE Transactions on Information Theory, 
Vol 49 #4, pp. 1007-1012 (2003). Note: The reported implementation appears to have mild 
numerical innaccuracies in the case of optimal measurement operators which are identically 



40 



zero. In particular, the numerical example reported in equation 40 has the unique exact so- 
lution m = (0, 0), m = N^ 1 ((1 + V5) /2, 1), and ^3 = ^ 3 _1 ((l - VS) A l), where 7V 2 , 3 
are normalization factors. (The identity = follows from the invcrtibility of L — p\.) 

Furthermore, the results concerning the ranks of optimal measurement operators had already 
been reported in [80]. 

C. W. Hclstrom, "Bayes-Cost reduction algorithm in quantum hypothesis testing," IEEE 
Trans. Inf. Theory IT-28, 359-366 (1982). Note: Ref. [74] asserts that the presented algo- 
rithm docs not always converge to an optimal measurement. 

J. Tyson, "Estimates of non-optimality of quantum measurements and a simple iterative 
method for computing optimal measurements," arXiv:0902.0395. Note: Convergence of this 
algorithm is quite slow. 

J. Benedetto and A. Kebo, "The role of frame force in quantum detection," Journal of Fourier 
Analysis and Applications 14, 443-474 (2008). 

J. Baez and J. Huerta, "The algebra of grand unified theories," Bull. Amer. Math. Soc. 47, 
483-552 (2010); E-print arXiv:0904.1556. 

H. Barnum, C. Caves, C. Fuchs, R. Josza, and B. Schumacher, "On quantum coding for 
ensembles of mixed states," J. Phys. A: Math. Gen 34 6767-6785 (2001); quant-ph/0008024. 

Andreas Winter, "'Extrinsic' and 'Intrinsic' Data in Quantum Measurements: Asymptotic 
Convex Decomposition of Positive Operator Valued Measures," Communications in Mathe- 
matical Physics 244, 157-185 (2004); quant-ph/0109050. 

Y. C. Eldar and G. D. Forney, "On quantum detection and the square-root measurement," 
IEEE Trans. Inf. Theory 47, 858 (2001); e-print arXiv:quant-ph/0005132. Note: The authors 
were unaware of [43]. 

A. S. Holcvo, "Remarks on the Choi-Jamiolkowski Correspondence in Infinite Dimensions," 
E-Print arXiv:1004.0196 (2010). 

One usually requires that a POVM satisfy J^Mj, = 1. The relaxed condition J^Mfe < 1 allows 
the possibility that the POVM may fail to return an result. (Alternatively, one may augment 
{Mfc} with the operator M orr0 r = 1 — J^Mfc. The operator M error could perhaps serve as a 
useful error flag, i.e. an indication that a state orthogonal to span(£ ) has been detected.) 

Other numerical methods for computing optimal measurements exist [96-99]. 

The maximum-overlap problem (1) is equivalent to the one-data-point special case of 
maximum-likelihood quantum process tomography. The specific iteration is defined by equa- 
tions 14 through 17 of [6]. 

Barnum and Knill incorrectly assert on page 2103 of [14] that the asymptotically-optimal 
measurement (24) introduced by Holevo in [43] is equal to the "pretty good" measurement 
(29). 

Indeed, in Quantum Field Theory it is natural to use dual states ip £ %* to represent an- 
tiparticles. See, for example, [100]. 

Basis-dependent versions may be found in [101, 102]. 

See [104] for the infinite-dimensional case. 



41 



[112] Indeed, if E = {pk \ipk) (V'fel} is an ensemble of linearly-independent pure states tpk spanning 
% then the maximizer E — G^ of Re (G, E) £ is of the form Ej. — \ipkj (efe|, with {e^} 
orthornormalJ 114 ! One may therefore express 
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e {e k } v ^ ' W> 



-vHolcvo 



({e*}), 



where optimization is over orthonormal {ek}- Here C Holevo = g^Vfc \\ipk — efc|| 2 is the "ap- 
proximate cost function" which was minimized by Holevo ([43], equation 8) in his construction 
of M^ w in the case of pure states. (See also [22, 103].) 

I — 1/2 / — 1/2 

[113] One needs the identity Tr B \J Trx PabP a Pab = \J(^ab\p a \i>AB) x 
Tr B y/TrA \^ab) {^ab\ = \f^A yfpt X Tr B ^fp~B = (Tr y/pX) 3 ^ ■ 
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normality of the follows from the inequalities ||efc|| 2 < \ee) (ei\ = 1 and the 

identity ||e^ || 2 = Tr \ e?) (e^| = dim'H. Orthogonality then follows from the identity 

M(£k)( efc |)M = IM 2 . 
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